5 TRANSGENIC MICE CONTAINING RETINA-SPECIFIC NUCLEAR 

RECEPTOR GENE DISRUPTIONS 

Related Applications 

This application is a continuation-in-part to U.S. Application No. 60/190,348, filed 
10 March 16, 2000. 

Field of the Invention 

The present invention relates to transgenic animals, compositions and methods relating to 
the characterization of gene function. 

Background of the Invention 

U Normal growth and differentiation of all organisms is dependent on cells responding 

j s | correctly to a variety of internal and external signals. Many of these signals produce their effects 
\f* by ultimately changing the transcription of specific genes. One well-studied group of proteins 
20 that mediate a cell's response to a variety of signals is the family of transcription factors known 
i ? * j as nuclear receptors. Members of this group include receptors for steroid hormones, vitamin D, 
^ ecdysone, cis and trans retinoic acid, thyroid hormone, fatty acids (and other peroxisomal 
q proliferators), as well as so-called orphan receptors, proteins that are structurally similar to other 
r " members of this group, but for which no ligands are known. Orphan receptors may be indicative 
25 of unknown signaling pathways in the cell or may be nuclear receptors that function without 
ligand activation. There are indications that the activation of transcription by some of these 
orphan receptors may occur in the absence of an exogenous ligand and/or through signal 
transduction pathways originating from the cell surface. 

Steroid hormones affect the growth and function of specific cells by binding to 
30 intracellular receptors (SR) and forming SR-hormone complexes. SR-hormone complexes then 
interact with a hormone response element (HRE) in the control region of specific genes and alter 
specific gene expression. cDNAs for many SRs have been isolated and characterized, making it 
possible to deduce the amino acid sequences of various steroid/thyroid/retinoic acid receptors 
and related members of the super family of nuclear receptors (Evans et ah, Science, 240:889-895 



5 (1988); Liao et aU J- Steroid Biochem., 34:(l-6) 41-51 (1989); Forman et ai, New Biol, 2:(7) 
587-594 (1990)). 

The complete coding sequences for human (AF148128) and murine (AF148129; SEQ ID 
NO:21) retina-specific nuclear receptors were determined and published by Chen et al. (Proc. 
Natl Acad. Sci. U.S.A. 96(26), 15149-15154 (1999)). According to Chen et al, human RNR is a 
10 splice variant of PNR. Northern blot and reverse transcription-PCR analyses of human mRNA 
samples demonstrated that RNR is expressed exclusively in the retina, with transcripts of 
approximately 7.5 kb, approximately 3.0 kb, and approximately 2.3 kb by Northern blot analysis. 
In situ hybridization with multiple probes on both primate and mouse eye sections demonstrated 
that RNR is expressed in the retinal pigment epithelium and in Muller glial cells. By using the 
\5\ Gal4 chimeric receptor/reporter cotransfection system, the ligand binding domain of RNR was 
'jjt found to repress transcriptional activity in the absence of exogenous ligand. Gel mobility shift 
H assays revealed that RNR can interact with the promoter of the cellular retinaldehyde binding 
y j protein gene in the presence of retinoic acid receptor (RAR) and/or retinoid X receptor (RXR). 
?[ Given the importance of retinoic acid receptors in the regulation of gene expression, a 

2© clear need exists for further characterization of these receptors which can play a role in 

..... ^ 

li\ preventing, ameliorating or correcting dysfunctions or diseases. 

Q Summary of the Invention 

The present invention generally relates to transgenic animals, as well as to compositions 
25 and methods relating to the characterization of gene function. More specifically, the present 
invention relates to nucleic acid sequences encoding a retina-specific nuclear receptor and in the 
in vivo characterization of genes encoding a retina-specific nuclear receptor. 

The present invention provides transgenic cells comprising a disruption in retina-specific 
nuclear receptor gene. Preferably, the transgenic cells of the present invention are stem cells 
30 and more preferably, embryonic stem (ES) cells, and most preferably, murine ES cells. 
Preferably, the target gene's coding sequence (i.e., exons) comprises SEQ ID NO: 21. 
According to one embodiment, the transgenic cells are produced by introducing a targeting 
construct into a stem cell to produce a homologous recombinant, resulting in a disruption of the 
target sequence encoding a retina-specific nuclear receptor. In another embodiment, the 
35 transgenic cells are derived from the transgenic animals described below. 
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5 The present invention also provides a targeting construct and methods of producing the 

targeting construct that when introduced into stem cells produces a homologous recombinant 
generating transgenic cells comprising a disruption in a retina-specific nuclear receptor. In one 
embodiment, the targeting construct of the present invention comprises first and second 
polynucleotide sequences that are homologous to the target sequence. The targeting construct 
10 also comprises a polynucleotide sequence that encodes a positive selection marker that is 
preferably positioned between the two different homologous polynucleotide sequences in the 
construct. 

The present invention further provides non-human transgenic animals comprising a 
disruption in a retina-specific nuclear receptor gene and methods of producing such transgenic 
\S% animals. The transgenic animals of the present invention include transgenic animals that are 
;^ heterozygous and homozygous for a mutation in the gene that naturally encodes and expresses a 
fa* functional retina-specific nuclear receptor gene. In one aspect, the transgenic animals of the 
; 4 j present invention are defective in the function of the retina-specific nuclear receptor gene. The 

present invention also encompasses cells and cell lines derived from the transgenic animals of 
2% the present invention. 

;]l The transgenic animals of the present invention further comprise a phenotype associated 

l *[ with having a defect or disruption in a retina-specific nuclear receptor gene. 
□ The present invention also provides a method of identifying agents capable of affecting a 

phenotype of a transgenic animal. According to this method, a putative agent is administered to 
25 a transgenic animal. The response of the transgenic animal to the putative agent is then 
measured and compared to the response of a "normal" or wild type mouse, or alternatively 
compared to a transgenic animal control (without agent administration). The invention further 
provides agents identified according to such methods. 

The present invention further provides a method of identifying agents having an effect on 
30 retina-specific nuclear receptor gene expression or function. The method includes administering 
an effective amount of the agent to a transgenic animal, preferably a mouse, having a disruption 
in a retina-specific nuclear receptor gene. The method includes measuring a response of the 
transgenic animal, for example, to the agent, and comparing the response of the transgenic 
animal to a control mouse. The response of the transgenic animal as compared to the control 
35 mouse may serve as an indication of the specificity or activity of the agent. Compounds that 
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5 may have an effect on retina-specific nuclear receptor gene expression or function may also be 
screened against cells in cell-based assays, for example, to identify such compounds. 

The present invention also provides methods of identifying agents useful as therapeutic 
agents for treating conditions associated with a disruption in a retina-specific nuclear receptor 
gene. In a preferred embodiment, conditions include those associated with the phenotypes of the 

10 mice of the present invention. In accordance with this method, the present invention provides 
animal models useful in identifying compounds that are able to affect a phenotype, such as a 
physiological or behavioral phenotype associated with a disruption of a retina-specific nuclear 
receptor gene. The method involves, for example, administering a putative agent to a transgenic 
animal. The response of the transgenic animal to the putative agent is then measured and 

\5 r * compared to the response of a "normal" or wild-type mouse, or alternatively compared to a 

transgenic animal control (without agent administration). The invention further provides agents 

M= identified according to such methods. 

; The invention also provides cell lines comprising nucleic acid sequences encoding a 
m retina- specific nuclear receptor. Such cell lines may be capable of expressing such sequences by 
2& virtue of operable linkage to a promoter functional in the cell line. Preferably, expression of the 
7: ] sequence encoding a retina-specific nuclear receptor is under the control of an inducible 
^ promoter. Also provided are methods of identifying agents that interact with retina-specific 
p nuclear receptor, comprising the steps of contacting a retina-specific nuclear receptor with an 
r ~ agent and detecting an agent/retina-specific nuclear receptor complex. Such complexes can be 
25 detected by, for example, measuring expression of an operably linked detectable marker. 

The invention further provides methods of treating diseases or conditions associated with 
a disruption in a gene encoding a retina-specific nuclear receptor, and more particularly, to a 
disruption in the expression or function of a retina-specific nuclear receptor. In a preferred 
embodiment, methods of the present invention involve treating diseases or conditions associated 
30 with a disruption in retina-specific nuclear receptor expression or function, including 

administering to a subject in need, a therapeutic agent which effects retina-specific nuclear 
receptor expression or function. In accordance with this embodiment, the method comprises 
administration of a therapeutically effective amount of a natural, synthetic, semi-synthetic, or 
recombinant retina-specific nuclear receptor or fragment thereof as well as natural, synthetic, 
35 semi- synthetic or recombinant analogs. 
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5 The present invention further provides methods of treating diseases or conditions 

associated with disrupted retina-specific nuclear receptor expression or function, wherein the 
methods comprise detecting and replacing through gene therapy mutated retina-specific nuclear 
receptor genes. 

10 Definitions 

As used herein, "gene" refers to (a) a gene containing at least one of the DNA sequences 
disclosed herein; (b) any DNA sequence that encodes the amino acid sequence encoded by the 
DNA sequences disclosed herein and/or; (c) any DNA sequence that hybridizes to the 
complement of the coding sequences disclosed herein. Preferably, the term includes coding as 
15 well as noncoding regions, and preferably includes all sequences necessary for normal gene 
i j|. expression including promoters, enhancers and other regulatory sequences, 
i*** As used herein, "gene targeting" is a type of homologous recombination that occurs when 

H a fragment of genomic DNA is introduced into a mammalian cell and that fragment locates and 
ill recombines with endogenous homologous sequences. 

2t> "Disruption" of a target gene occurs when a fragment of genomic DNA locates and 

l y\ recombines with an endogenous homologous sequence such that production of the normal wild 

Ml 

M type gene product is inhibited or functionally disrupted, resulting in, for example, partial or 
complete loss of expression of a protein encoded by a target gene. Non-limiting examples of 

!«- disruption include insertion, missense, frameshift and deletion mutations. Gene targeting can also 
25 alter a promoter, enhancer, or splice site of a target gene to cause disruption, and can also involve 
replacement of a promoter with an exogenous promoter such as an inducible promoter described 
below. 

As used herein, a "transgenic animal" is an animal that contains within its genome a 
specific gene that has been disrupted or inactivated completely or partially by the method of gene 
30 targeting. The transgenic animal includes both the heterozygote animal (i.e. 9 one defective allele 
and one wild-type allele) and the homozygous animal (i.e., two defective alleles). 

The terms "polynucleotide" and "nucleic acid molecule" are used interchangeably to refer 
to polymeric forms of nucleotides of any length. The polynucleotides may contain 
deoxyribonucleotides, ribonucleotides and/or their analogs. Nucleotides may have any three- 
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5 dimensional structure, and may perform any function, known or unknown. The term 
"polynucleotide" includes single-, double-stranded and triple helical molecules. 

"Oligonucleotide" refers to polynucleotides of between 5 and about 100 nucleotides of 
single- or double-stranded DNA. Oligonucleotides are also known as oligomers or oligos and 
may be isolated from genes, or chemically synthesized by methods known in the art. A "primer" 
10 refers to an oligonucleotide, usually single-stranded, that provides a 3-hydroxyl end for the 
initiation of enzyme-mediated nucleic acid synthesis. 

The following are non-limiting embodiments of polynucleotides: a gene or gene 
fragment, exons, introns, mRNA, tRNA, rRNA, ribozymes, cDNA, recombinant 
polynucleotides, branched polynucleotides, plasmids, vectors, isolated DNA of any sequence, 
15 isolated RNA of any sequence, nucleic acid probes and primers. A nucleic acid molecule may 
;.f\ also comprise modified nucleic acid molecules, such as methylated nucleic acid molecules and 

nucleic acid molecule analogs. Analogs of purines and pyrimidines are known in the art, and 
H include, but are not limited to, aziridinycytosine, 4-acetylcytosine, 5-fluorouracil, 5-bromouracil, 
m S-carboxymethylaminomethyl-2-thiouracil, 5-carboxymethyl-aminomethyluracil, inosine, N6- 
20 isopentenyladenine, 1-methyladenine, 1-methylpseudouracil, 1-methylguanine, 1-methylinosine, 
O 2,2-dimethylguanine, 2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, 
i,^ pseudouracil, 5-pentylnyl uracil and 2,6-diaminopurine. The use of uracil as a substitute for 
% thymine in a deoxyribonucleic acid is also considered an analogous form of pyrimidine. 

A "fragment" of a polynucleotide is a polynucleotide comprised of at least 9 contiguous 
25 nucleotides, preferably at least 15 contiguous nucleotides and more preferably at least 45 
nucleotides, of coding or non-coding sequences. 

As used herein, "base pair," also designated "bp," refers to the complementary nucleic 
acid molecules. In DNA there are four "types" of bases: the purine base adenine (A) is hydrogen 
bonded with the pyrimidine base thymine (T), and the purine base guanine (G) with the 
30 pyrimidine base cytosine (C). Each hydrogen bonded base pair set is also known as a Watson- 
Crick base-pair. A thousand base pairs is often called a kilobase pair, or kb. A "base pair 
mismatch" refers to a location in a nucleic acid molecule in which the bases are not 
complementary Watson-Crick pairs. The phrase "does not include at least one type of base at 
any position" refers to a nucleotide sequence which does not have one of the four bases at any 
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5 position. For example, a sequence lacking one nucleotide (i.e., lacking one type of base) could 
be made up of A, G, T base pairs and contain no C residues. 

As used herein, the term "construct" refers to an artificially assembled DNA segment to 
be transferred into a target tissue, cell line or animal, including human. Typically, the construct 
will include the gene or a sequence of particular interest, a marker gene and appropriate control 

10 sequences. The term "plasmid" refers to an autonomous, self-replicating extrachromosomal 
DNA molecule. In a preferred embodiment, the plasmid construct of the present invention 
contains a positive selection marker positioned between two flanking regions of the gene of 
interest. Optionally, the construct can also contain a screening marker, for example, green 
fluorescent protein (GFP). If present, the screening marker is positioned outside of and some 

lj5. % distance away from the flanking regions, 

W The term "polymerase chain reaction" or "PCR" refers to a method of amplifying a DNA 

base sequence using a heat-stable polymerase such as Taq polymerase, and two oligonucleotide 

!7s primers; one complementary to the (n-)-strand at one end of the sequence to be amplified and the 
other complementary to the (-)-strand at the other end. Because the newly synthesized DNA 

20 strands can subsequently serve as additional templates for the same primer sequences, successive 
J" i rounds of primer annealing, strand elongation, and dissociation produce exponential and highly 
M specific amplification of the desired sequence. PCR also can be used to detect the existence of 
q the defined sequence in a DNA sample. "Long-range" refers to PCR conditions which allow 
f "~ amplification of large nucleotides stretches, for example, greater than 1 kb. 

25 As used herein, the term "positive selection marker" refers to a gene encoding a product 

that enables only the cells that carry the gene to survive and/or grow under certain conditions. 
For example, plant and animal cells that express the introduced neomycin resistance (Neo r ) gene 
are resistant to the compound G41 8. Cells that do not carry the Neo r gene marker are killed by 
G418. Other positive selection markers will be known to those of skill in the art. 

30 "Positive-negative selection" refers to the process of selecting cells that carry a DNA 

insert integrated at a specific targeted location (positive selection) and also selecting against cells 
that carry a DNA insert integrated at a non-targeted chromosomal site (negative selection). Non- 
limiting examples of negative selection inserts include the gene encoding thymidine kinase (tk). 
Genes suitable for positive-negative selection are known in the art, see e.g., U.S. Patent 

35 5,464,764. 
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5 "Screening marker" or "reporter gene" refers to a gene that encodes a product that can 

readily be assayed. For example, reporter genes can be used to determine whether a particular 
DNA construct has been successfully introduced into a cell, organ or tissue. Non-limiting 
examples of screening markers include genes encoding for green fluorescent protein (GFP) or 
genes encoding for a modified fluorescent protein. "Negative screening marker" is not to be 
10 construed as negative selection marker; a negative selection marker typically kills cells that 
express it. 

The term "vector" refers to a DNA molecule that can carry inserted DNA and be 
perpetuated in a host cell. Vectors are also known as cloning vectors, cloning vehicles or 
vehicles. The term includes vectors that function primarily for insertion of a nucleic acid 
15 r molecule into a cell, replication vectors that function primarily for the replication of nucleic acid, 

I and expression vectors that function for transcription and/or translation of the DNA or RNA. 
\„l Also included are vectors that provide more than one of the above functions. In a preferred 

embodiment, the vector contains sites useful in the methods described herein, for example, the 
□1 vectors M pDG2" or "pDG4" as described herein. 

ig A "host cell" includes an individual cell or cell culture which can be or has been a 

!;;{ recipient for vector(s) or for incorporation of nucleic acid molecules and/or proteins. Host cells 
M include progeny of a single host cell, and the progeny may not necessarily be completely 
jpl identical (in morphology or in total DNA complement) to the original parent due to natural, 
accidental, or deliberate mutation. A host cell includes cells transfected with the constructs of 
25 the present invention. 

The term "genomic library" refers to a collection of clones made from a set of randomly 
generated overlapping DNA fragments representing the genome of an organism. A "cDNA 
library" (complementary DNA library) is a collection of mRNA molecules present in a cell, 
tissue, or organism, turned into cDNA molecules with the enzyme reverse transcriptase, then 
30 inserted into vectors (other DNA molecules which can continue to replicate after addition of 
foreign DNA). Exemplary vectors for libraries include bacteriophage (also known as "phage"), 
which are viruses that infect bacteria, for example lambda phage. The library can then be probed 
for the specific cDNA (and thus mRNA) of interest. In one embodiment, library systems which 
combine the high efficiency of a phage vector system with the convenience of a plasmid system 
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5 (for example, ZAP system from Stratagene, La Jolla, CA) are used in the practice of the present 
invention. 

The term "homologous recombination" refers to the exchange of DNA fragments 
between two DNA molecules or chromatids at the site of homologous nucleotide sequences, i.e., 
those sequences preferably having at least about 70 percent sequence identity, typically at least 
10 about 85 percent identity, and preferably at least about 90 percent identity. Homology can be 
determined using a "BLASTN" algorithm. It is understood that homologous sequences can 
accommodate insertions, deletions and substitutions in the nucleotide sequence. Thus, linear 
sequences of nucleotides can be essentially identical even if some of the nucleotide residues do 
not precisely correspond or align. 
15 As used herein the term "ligation-independent cloning" is used in the conventional sense 

1 1| to refer to incorporation of a DNA molecule into a vector or chromosome without the use of 
^ kinases or ligases. Ligation-independent cloning techniques are described, for instance, in 
¥\ Aslanidis & de Jong, Nucleic Acids Res., 18:6069-74 and U.S. Patent Application Serial 
m No. 07/847,298 (1991). 

2 ! 0 " As used herein, the term "target sequence" (alternatively referred to as "target gene 

□ sequence" or "target DNA sequence") refers to the nucleic acid molecule with any 
Uk polynucleotide having a sequence in the general population that is not associated with any 
% disease or discernible phenotype. It is noted that in the general population, wild-type genes may 
K fe include multiple prevalent versions that contain alterations in sequence relative to each other and 
25 yet do not cause a discernible pathological effect. These variations are designated 
"polymorphisms" or "allelic variations." 

In a preferred embodiment, the target DNA sequence comprises a portion of a particular 
gene or genetic locus in the individual's genomic DNA. The target DNA sequence encodes a 
retina-specific nuclear receptor. According to one embodiment, the target DNA comprises part 
30 of a particular gene or genetic locus in which the function of the gene product is not known, for 
example, a gene identified using a partial cDNA sequence such as an EST. The target retina- 
specific nuclear receptor gene comprises the coding sequence represented by SEQ ID NO:21 or a 
naturally occurring allelic variation or homologue of the target gene. 

The term "exonuclease" refers to an enzyme that cleaves nucleotides sequentially from 
35 the free ends of a linear nucleic acid substrate. Exonucleases can be specific for double or 
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5 single-stranded nucleotides and/or directionally specific, for instance, 3-5' and/or 5-3 Some 
exonucleases exhibit other enzymatic activities, for example, T4 DNA polymerase is both a 
polymerase and an active 3 -5' exonuclease. Other exemplary exonucleases include exonuclease 
EI which removes nucleotides one at a time from the 5 -end of duplex DNA which does not have 
a phosphorylated 3 -end, exonuclease VI which makes oligonucleotides by cleaving nucleotides 
10 off of both ends of single-stranded DNA, and exonuclease lambda which removes nucleotides 
from the 5' end of duplex DNA which have 5 -phosphate groups attached to them. 

The term "recombinase" encompasses enzymes that induce, mediate or facilitate 
recombination, and other nucleic acid modifying enzymes that cause, mediate or facilitate the 
rearrangement of a nucleic acid sequence, or the excision or insertion of a first nucleic acid 
15 sequence from or into a second nucleic acid sequence. The "target site" of a recombinase is the 
; : r| nucleic acid sequence or region that is recognized (e.g., specifically binds to) and/or acted upon 

(excised, cut or induced to recombine) by the recombinase. As used herein, the expression 
H "enzyme-directed site-specific recombination" is intended to include the following three events: 

1. deletion of a pre-selected DNA segment flanked by recombinase target sites; 
28* 2. inversion of the nucleotide sequence of a pre-selected DNA segment flanked by 

Q recombinase target sites; and 

3. reciprocal exchange of DNA segments proximate to recombinase target sites located 
^: on different DNA molecules. 



25 Brief Description of the Drawings 

Figure 1 is a schematic depicting one method of constructing a targeting vector of the 
present invention. The plasmid PCR method is described in Examples 9 and 10. 

Figure 2A is a schematic depicting the pDG2 vector. The vector contains an ampicillin 
resistance gene and a neomycin (Neo r ) gene. On each side of the Neo r gene are two sites for 
30 ligation-independent cloning along with restriction sites. The sequence of pDG2 is shown in 
Figure 2B and SEQ ID NO: 1 . 

Figure 3A is schematic depicting the pDG4 vector. The vector contains an ampicillin 
resistance gene, a neomycin (Neo r ) gene and a green fluorescent protein (GFP) gene. On each 
side of the Neo r gene are two sites for ligation-independent cloning along with restriction 
35 enzyme recognition sites. The sequence of pDG4 is shown in Figure 3B and SEQ ID NO:2. 



10 



Figure 4 (SEQ E) N0:3 through SEQ ID NO: 10) shows the nucleic acid sequence before 
and after T4 DNA polymerase treatment of annealing sites 1-4 contained on the ends of PCR- 
amplified genomic DNA. 

Figure 5 (SEQ ID NO: 11 through SEQ ID NO: 18) shows the nucleic acid sequence 
before and after T4 DNA polymerase treatment of annealing site 1-4 contained within the pDG2 
vector. 

Figure 6 shows the arrangement of 5' and 3' flanking DNA relative to annealing sites 1, 2, 
3 and 4 within the pDG2 vector during an annealing reaction. 

Figure 7 shows the arrangement of 5' and 3 ' flanking DNA relative to annealing sites 1, 2, 
3 and 4 and the GFP screening marker within the pDG4 vector during an annealing reaction. 

Figure 8 shows the polynucleotide sequence identified as SEQ ID NO: 19. Figure 8 also 
shows the sequences identified as SEQ ID NO: 20 and SEQ ID NO:21, which were used in the 
retina-specific nuclear receptor gene targeting construct. 

Detailed Description of the Invention 

The invention is based, in part, on the evaluation of the expression and role of genes and 
gene expression products, primarily those associated with retina-specific nuclear receptor. 
Among others, the invention permits the definition of disease pathways and the identification of 
diagnostically and therapeutically useful targets. For example, genes which are mutated or 
down-regulated under disease conditions may be involved in causing or exacerbating the disease 
condition. Treatments directed at up-regulating the activity of such genes or treatments which 
involve alternate pathways, may ameliorate the disease condition. 

Any technique known in the art may be used to introduce a target gene transgene into 
animals to produce the founder lines of transgenic animals. Such techniques include, but are not 
limited to pronuclear microinjection (U.S. Pat. No. 4,873,191); retrovirus mediated gene transfer 
into germ lines (Van der Putten, et al, Proc. Natl Acad. ScL 9 USA, 82:6148-6152 (1985)); gene 
targeting in embryonic stem cells (Thompson, etal, Cell, 56:313-321 (1989)); electroporation of 
embryos (Lo, Mol Cell Biol., 3:1803-1814 (1983)); and sperm-mediated gene transfer 
(Lavitrano, et al, Cell, 57:717-723 (1989)); etc. For a review of such techniques, see Gordon, 
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5 Transgenic Animals, Intl. Rev, Cytol., 115:171-229 (1989), which is incorporated by reference 
herein in its entirety. 

In a preferred embodiment, homologous recombination is used to generate the knockout 
mice of the present invention. Preferably, the construct is generated in two steps by 
(1) amplifying (for example, using long-range PCR) sequences homologous to the target 

10 sequence, and (2) inserting another polynucleotide (for example a selectable marker) into the 
PCR product so that it is flanked by the homologous sequences. Typically, the vector is a 
plasmid from a plasmid genomic library. The completed construct is also typically a circular 
plasmid. Thus, as shown in Figure 1, using long-range PCR with "outwardly pointing' 1 
oligonucleotides results in a vector into which a selectable marker can easily be inserted, 

15* preferably by ligati on-independent cloning. The construct can then be introduced into ES cells, 

41 where it can disrupt the function of the homologous target sequence. 

Ui, Homologous recombination may also be used to knockout genes in stem cells, and other 

I ^ cell types, which are not totipotent embryonic stem cells. By way of example, stem cells may be 
IP myeloid, lymphoid, or neural progenitor and precursor cells. Such transgenic cells may be 
2Q particularly useful in the study of target gene function in individual developmental pathways. 
Stem cells may be derived from any vertebrate species, such as mouse, rat, dog, cat, pig, rabbit, 
M human, non-human primates and the like. 

p In cells which are not totipotent it may be desirable to knock out both copies of the target 

■ 8S * using methods which are known in the art. For example, cells comprising homologous 
25 recombination at a target locus which have been selected for expression of a positive selection 
marker (e.g., Neor) and screened for non-random integration, can be further selected for multiple 
copies of the selectable marker gene by exposure to elevated levels of the selective agent (e.g., 
G418). The cells are then analyzed for homozygosity at the target locus. Alternatively, a second 
construct can be generated with a different positive selection marker inserted between the two 
30 homologous sequences. The two constructs can be introduced into the cell either sequentially or 
simultaneously, followed by appropriate selection for each of the positive marker genes. The 
final cell is screened for homologous recombination of both alleles of the target. 

In another aspect, two separate fragments of a clone of interest are amplified and inserted 
into a vector containing a positive selection marker using ligation-independent cloning 
35 techniques. In this embodiment, the clone of interest is generally from a phage library and is 



5 identified and isolated using PCR techniques. The ligation-independent cloning can be 
performed in two steps or in a single step. 

According to a preferred method, constructs are used having multiple sites where 5-3' 
single-stranded regions can be created. These constructs, preferably plasmids, include a vector 
capable of directional, four-way ligation-independent cloning. 
10 The constructs typically include a sequence encoding a positive selection marker such as 

a gene encoding neomycin resistance; a restriction enzyme site on either side of the positive 
selection marker and a sequence flanking the restriction enzyme sites which does not contain one 
of the four base pairs. This configuration allows single-stranded ends to be created in the 
sequence by digesting the construct with the appropriate restriction enzyme and treating the 
i£% fragments with a compound having exonuclease activity, for example T4 DNA polymerase. 
■*;U S In one preferred embodiment, a construct suitable for introducing targeted mutations into 

|«i ES cells is prepared directly from a plasmid genomic library. Using long-range PCR with 
; . g specific primers, a sequence of interest is identified and isolated from the plasmid library in a 
'f* single step. Following isolation of this sequence, a second polynucleotide that will disrupt the 
2Q target sequence can be readily inserted between two regions encoding the sequence of interest. 
'? \ Using this direct method a targeted construct can be created in as little as 72 hours. In another 
H embodiment, a targeted construct is prepared after identification of a clone of interest in a phage 
q genomic library as described in detail below. 

! 5 " The methods described herein obviate the need for hybridization isolation, restriction 

25 mapping and multiple cloning steps. Moreover, the function of any gene can be determined 
using these methods. For example, a short sequence (e.g., EST) can be used to design 
oligonucleotide probes. These probes can be used in the direct amplification procedure to create 
constructs or can be used to screen genomic or cDNA libraries for longer full-length genes. 
Thus, it is contemplated that any gene can be quickly and efficiently prepared for use in ES cells. 
30 In a preferred embodiment, constructs are prepared directly from a plasmid genomic 

library. The library can be produced by any method known in the art. Preferably, DNA from 
mouse ES cells is isolated and treated with a restriction endonuclease which cleaves the DNA 
into fragments. The DNA fragments are then inserted into a vector, for example a bacteriophage 
or phagemid (e.g., Lamda ZAP™, Stratagene, La Jolla, CA) systems. When the library is 
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5 created in the ZAP™ system, the DNA fragments are preferably between about 5 and about 20 
kilobases. 

Preferably, the organism(s) from which the libraries are made will have no discernible 
disease or phenotypic effects. Preferably, the library is a mouse library. This DNA may be 
obtained from any cell source or body fluid. Non-limiting examples of cell sources available in 
10 clinical practice include ES cells, liver, kidney, blood cells, buccal cells, cerviovaginal cells, 
epithelial cells from urine, fetal cells, or any cells present in tissue obtained by biopsy. Body 
fluids include urine, blood cerebrospinal fluid (CSF), and tissue exudates at the site of infection 
or inflammation. DNA extracted from the cells or body fluid using any method known in the art. 
Preferably, the DNA is extracted by adding 5 ml of lysis buffer (10 mM Tris-HCl pH 7.5), 
15 % 10 mM EDTA (pH 8.0), 10 mM NaCl, 0.5% SDS and 1 mg/ml Proteinase K) to a confluent 100 
f 4! mm plate of embryonic stem cells. The cells are then incubated at about 60°C for several hours 

or until fully lysed. Genomic DNA is purified from the lysed cells by several rounds of gentle 
f^i phenol: chloroform extraction followed by an ethanol precipitation. For convenience, the 
|J S genomic library can be arrayed into pools. 

2Q In a preferred embodiment, a sequence of interest is identified from the plasmid library 

!" s using oligonucleotide primers and long-range PCR. Typically, the primers are outwardly- 
H pointing primers which are designed based on sequence information obtained from a partial gene 
□ sequence, e.g., a cDNA or an EST sequence. As depicted for example in Figure 1, the product 
r "' will be a linear fragment that excludes the region which is located between each primer. 
25 PCR conditions found to be suitable are described below in the Examples. It will be 

understood that optimal PCR conditions can be readily determined by those skilled in the art. 
(See, e.g., PCR 2: A Practical Approach (1995) eds. MJ. McPherson, B.D. Hames and G.R. 
Taylor, IRL Press, Oxford; Yu, et al, Methods Mol Bio., 58:335-9 (1996); Munroe, et al, Proc. 
Nat'lAcad. ScL, USA, 92:2209-13 (1995)). PCR screening of libraries eliminates many of the 
30 problems and time-delay associated with conventional hybridization screening in which the 
library must be plated, filters made, radioactive probes prepared and hybridization conditions 
established. PCR screening requires only oligonucleotide primers to sequences (genes) of 
interest. PCR products can be purified by a variety of methods, including but not limited to, 
microfiltration, dialysis, gel electrophoresis and the like. It may be desirable to remove the 
35 polymerase used in PCR so that no new DNA synthesis can occur. Suitable thermostable DNA 
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5 polymerases are commercially available, for example, Vent™ DNA Polymerase (New England 
Biolabs), Deep Vent™ DNA Polymerase (new England Biolabs), HotTub™ DNA Polymerase 
(Amersham), Thermo Sequenase™ (Amersham), rBst™ DNA Polymerase (Epicenter), Pfu™ 
DNA Polymerase (Stratagene), Amplitaq Gold™ (Perkin Elmer), and Expand™ (Boehringer- 
Mannheim). 

10 To form the completed construct, a sequence which will disrupt the target sequence is 

inserted into the PCR-amplified product. For example, as described herein, the direct method 
involves joining the long-range PCR product (Le. f the vector) and one fragment (Le,, a gene 
encoding a selectable marker). As discussed above, the vector contains two different sequence 
regions homologous to the target DNA sequence. Preferably, the vector also contains a sequence 
i£\ encoding a selectable marker, such as ampicillin. The vector and fragment are designed so that, 
%\ when treated to form single stranded ends, they will anneal such that the fragment is positioned 
H between the two different regions of substantial homology to the target gene, 
jjj Although any method of cloning is suitable, it is preferred that ligation-independent 

cloning strategies be used to assemble the construct comprising two different homologous 
20 regions flanking a selectable marker. Ligation-independent cloning (LIC) is a strategy for the 
ij ! directional cloning of polynucleotides without the use of kinases or ligases. (See, e.g., Aslanidis 
V et al, Nucleic Acids Res., 18:6069-74 (1990); Rashtchian, Current Opin. Biotech., 6:30-36 

Ma 

O (1995)). Single-stranded tails (also referred to as cloning sites or annealing sequences) are 
created in LIC vectors, usually by treating the vector (at a digested restriction enzyme site) with 

25 T4 DNA polymerase in the presence of only one dNTP. The 3' to 5' exonuclease activity of T4 
DNA polymerase removes nucleotides until it encounters a residue corresponding to the single 
dNTP present in the reaction mix. At this point, the 5' to 3' polymerase activity of the enzyme 
counteracts the exonuclease activity to prevent further excision. The vector is designed such that 
the single-stranded tails created are non-complementary. For example, in the pDG2 vector, none 

30 of the single-stranded tails of the four annealing sites are complementary to each other. PCR 
products are created by building appropriate 5' extensions into oligonucleotide primers. The 
PCR product is purified to remove dNTPs (and original plasmid if it was used as template) and 
then treated with T4 DNA polymerase in the presence of the appropriate dNTP to generate the 
specific vector-compatible overhangs. Cloning occurs by annealing of the compatible tails. 

35 Single-stranded tails are created at the ends of the clone fragments, for example using chemical 
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5 or enzymatic means. Complementary tails are created on the vector; however, to prevent 

annealing of the vector without insert, the vector tails are not complementary to each other. The 
length of the tails is at least about 5 nucleotides, preferably at least about 12 nucleotides, even 
more preferably at least about 20 nucleotides. 

In one embodiment, placing the overlapping vector and fragment(s) in the same reaction 
10 is sufficient to anneal them. Alternatively, the complementary sequences are combined, heated 
and allowed to slowly cool. Preferably the heating step is between about 60°C and about 100°C, 
more preferably between about 60°C and 80°C, and even more preferably between 60°C and 
70°C The heated reactions are then allowed to cool. Generally, cooling occurs rather slowly, 
for instance the reactions are generally at about room temperature after about an hour. The 
lSr\ cooling must be sufficiently slow as to allow annealing. The annealed fragment/vector can be 
X: used immediately, or stored frozen at -20°C until use. 

J 5! ' s Further, annealing can be performed by adjusting the salt and temperature to achieve 

uj suitable conditions. Hybridization reactions can be performed in solutions ranging from about 
:\ 10 mM NaCl to about 600 mM NaCl, at temperatures ranging from about 37°C to about 65°C, 
20 It will be understood that the stringency of the hybridization reaction is determined by both the 
uj salt concentration and the temperature. For instance, a hybridization performed in 10 mM salt at 

37°C may be of similar stringency to one performed in 500 mM salt at 65 °C. For the present 
CI invention, any hybridization conditions may be used that form hybrids between homologous 
complementary sequences. 
25 As shown in Figure 1, in one embodiment, a construct is made after using any of these 

annealing procedure where the vector portion contains the two different regions of substantial 
homology to the target gene (amplified from the plasmid library using long-range PCR) and the 
fragment is a gene encoding a selectable marker. 

After annealing, the construct is transformed into competent E. coli cells by methods 
30 known in the art, to amplify the construct. The isolated construct is then ready for introduction 
into ES cells. 

In another embodiment, a clone of interest is identified in a pooled genomic library using 
PCR. In one embodiment, the PCR conditions are such that a gene encoding a selectable marker 
can be inserted directly into the positively identified clone. The marker is positioned between 
35 two different sequences having substantial homology to the target DNA. 
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5 Genomic phage libraries can be prepared by any method known in the art. Preferably, a 

mouse embryonic stem cell library is prepared in lambda phage by cleaving genomic DNA into 
fragments of approximately 20 kilobases in length. The fragments are then inserted into any 
suitable lambda cloning vector, for example lambda Fix II or lambda Dash H (Stratagene, La 
Jolla, Ca) 

10 In order to quickly and efficiently screen a large number of clones from a library, pools 

may be created of plated libraries. In a preferred embodiment, a genomic lambda phage library 
is plated at a density of approximately 1,000 clones (plaques) per plate. Sufficient plates are 
created to represent the entire genome of the organism several times over. For example, 
approximately 1 million clones (1000 plates) will yield approximately 8 genome equivalents. 
15 a The plaques are then collected, for example by overlaying the plate with a buffer solution, 
?,|| incubating the plates and recollecting the buffer. The amount of buffer used will vary according 
to the plate size, generally one 100 mm diameter plate will be overlayed with approximately 4 ml 
y \\ of buffer and approximately 2 ml will be collected. 

i;m It will be understood that the individual plate lysates can be pooled at any time during 

2$ this procedure and that they can be pooled in any combinations. For ease in later identification 
] ~i of single clones, however, it is preferable to keep each plate lysate separately and then make a 
{*** pool. For example, each 2 ml lysate can be placed in a 96 well deep well plate. Pools can then 
%l be formed by taking an amount, preferably about 100 Dl, from each well and combining them in 
^ h the well of a new plate. Preferably, 100 Dl of 12 individual plate lysates are combined in one 
25 well, forming a 1.2 ml pool representative of 12,000 clones of the library. 

Each pool is then PCR-amplified using a set of PCR primers known to amplify the target 
gene. The target gene can be a known full-length gene or, more preferably, a partial cDNA 
sequence obtained from publicly available nucleic acid sequence databases such as GenBank or 
EMBL. These databases include partial cDNA sequences known as expressed sequence tags 
30 (ESTs). The oligonucleotide PCR primers can be isolated from any organism by any method 
known in the art or, preferably, synthesized by chemical means. 

Once a positive clone of the target gene has been identified in a genomic library, two 
fragments encoding separate portions of the target gene must be generated. In other words, the 
flanking regions of the small known region of the target (e.g., EST) are generated. Although the 
35 size of each flanking region is not critical and can range from as few as 100 base pairs to as 
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5 many as 100 kb, preferably each flanking fragment is greater than about 1 kb in length, more 
preferably between about 1 and about 10 kb, and even more preferably between about 1 and 
about 5 kb. One of skill in the art will recognize that although larger fragments may increase the 
number of homologous recombination events in ES cells, larger fragments will also be more 
difficult to clone. 

10 In one embodiment, one of the oligonucleotide PCR primers used to amplify a flanking 

fragment is specific for the library cloning vector, for example lambda phage. Therefore, if the 
library is a lambda phage library, primers specific for the lambda phage arms can be used in 
conjunction with primers specific for the positive clone to generate long flanking fragments. 
Multiple PCR reactions can be set up to test different combinations of primers. Preferably, the 
15 primers used will generate flanking sequences between about 2 and about 6 kb in length. 
U [S Preferably, the oligonucleotide primers are designed with 5' sequences complementary to 

the vector into which the fragments will be cloned. In addition, the primers are also designed so 
^ that the flanking fragments will be in the proper 3-5' orientation with respect to the vector and 
r|! each other when the construct is assembled. 
20 " Thus, using PCR-based methods, for example, positive clones can be identified by 

Q visualization of a band on an electrophoretic gel. 

u In one aspect, the cloning involves a vector and two fragments. The vector contains a 

positive selection marker, preferably Neo r , and cloning sites on each side of the positive selection 
\*' k marker for two different regions of the target gene. Optionally, the vector also contains a 
25 sequence coding for a screening marker (reporter gene), preferably, positioned opposite the 

positive selection marker. The screening marker will be positioned outside the flanking regions 
of homologous sequences. Figure 3 A shows one embodiment of the vector with the screening 
marker, GFP, positioned on one side of the vector. However, the screening marker can be 
positioned anywhere between Not I and Site 4 on the side opposite the positive selection marker, 
30 Neo r . 

One example of a suitable vector is the plasmid vector shown in Figure 2 having the 
sequence of SEQ ID NO:l. The specific nucleic acid ligati on-independent cloning sites (also 
referred to herein as annealing sites) labeled "sites 1, 2, 3 or 4" in Figure 1 are also shown herein. 
Generally, the cloning sites are lacking at least one type of base, i.e., thymine (T), guanine (G), 
35 cytosine (C) or adenine (A). Accordingly, reacting the vector with an enzyme that acts as both a 
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5 polymerase and exonuclease in presence of only the one missing nucleotide will create an 

overhang. For example, T4 DNA polymerase acts as both a 3 -5' exonuclease and a polymerase. 
Thus, when there are insufficient nucleotides available for the polymerase activity, T4 will act as 
an exonuclease. Specific overhangs can therefore be created by reacting the pDG2 vector with 
T4 DNA polymerase in the presence of dTTP only. Other enzymes useful in the practice of this 
10 invention will be known to those in the art, for instance uracil DNA glycosylase (UDG) (See, 
e.g., WO 93/18175). The vector exemplified herein has an overhand of 24 nucleotides. It will 
be known by those skilled in the art that as few as 5 nucleotides are required for successful 
ligation independent cloning. 

In another embodiment, a construct is assembled in a two-step cloning protocol. In the 
15. first step, each cloning region of homology is separately cloned into two of the annealing sites of 
the vector. For example, an "upstream" region of homology is cloned into annealing sites 1 and 
Ci 2 while a separate cloning, a "downstream" region of homology is cloned into annealing sites 3 
f " and 4. Once clones containing each single region of homology are identified, a targeting 

construct containing both regions of homology can be created by digesting each clone with 
2Q restriction enzymes where one enzyme digests outside of annealing site 1 (e.g., Not I in 

Figure 2 A) and another enzyme digests between the positive selection marker and annealing site 
M : 3 (e.g., Sal I in Figure 2A). The fragments containing the flanking homology regions from each 
r-^ construct will be purified (e.g., by gel electrophoresis) and combined using standard ligation 
jMB techniques known in the art, to produce the resulting targeting construct. 
25 In yet another embodiment, a construct according to one aspect of the present invention 

can be formed in a single-step, four-way ligation procedure. The vector and fragments are 
treated as described above. Briefly, the vector is treated to form two pieces, each piece having a 
single-stranded tail of specific sequence on each end. Likewise, the PCR-amplified flanking 
fragments are also treated to form single-stranded tails complementary to those of the vector 
30 pieces. The treated vector pieces and fragments are combined and allowed to anneal as 

described above. Because of the specificity of the single-stranded tails, the final construct will 
contain the fragments separated by the positive selection marker in the proper orientation. 

The final plasmid constructs are amplified in bacteria, purified and can then be 
introduced into ES cells, or stored frozen at -20°C until use. Where so desired, the vector is 
35 introduced into an embryonic stem cell line (e.g., by electroporation) and cells in which the 
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5 introduced DNA has homologously recombined with the endogenous DNA are selected (see e.g., 
Li, et al, Cell, 69:91526 (1992)). Successful recombination may be verified using various 
techniques known in the art, such as PCR and/or Southern analysis. Typically, several hundred 
individual colonies are selected following drug selection in G418 (for Neo cassettes), expanded 
for DNA preparation and screened for homologous recombination by PCR analysis. The PCR 
10 screening procedure uses a target gene specific oligonucleotide that is not present on the 

targeting vector and an oligonucleotide corresponding to the Neo (or other selectable marker) 
cassette. The selection of oligonucleotides outside the targeting vector is used to differentiate 
homologous recombinants from random integrations of the targeting vector. In general, four 
independent target gene specific oligonucleotides not present on the targeting vector are tested 
IS, on wild type ES cell DNA in combination with target gene specific oligonucleotides that are 
! 4f adjacent to the insertion site of the Neo (Figure 9). Oligonucleotides producing background 
u bands or failing to give the predicted size product are eliminated. A single target gene specific 
\ 2 | oligonucleotide is selected and paired with an oligonucleotide corresponding to the Neo cassette. 
; bM ES cells that are PCR positive in this screen are confirmed by a second PCR experiment that 
20 utilizes a different pair of target gene specific and Neo gene (or other selectable marker) specific 
i*= I oligonucleotides that are adjacent to, but distinct from, the original oligonucleotide pair. In 

addition, this protocol may be repeated using oligonucleotides specific for target gene sequences 
□ located on the opposite side of the selectable marker in conjunction with a marker specific 
r " oligonucleotide. In this way proper integration of both homologous sequences of the targeting 
25 vector is verified. 

Southern blot hybridization may also be used to confirm the ES cell targeting event using 
a probe that is not contained on the targeting vector but is adjacent to the predicted crossover site 
of homologous recombination. Southern blot experiments testing for homologous recombination 
should detect two distinct bands representing the wild type chromosome and mutant gene 
30 targeted allele. High molecular weight genomic DNA is prepared from control ES cell parental 
lines and ES cell lines that are PCR positive for homologous recombination. The DNA is 
digested with a restriction enzyme (EcoRl) that has been demonstrated by restriction mapping to 
not cut the targeting vector within the arm of the target gene DNA homology and to be 
diagnostic of homologous recombination. As an EcoRl site is present in the Neo gene, a 
35 homologous recombination event should result in the insertion of the Neo cassette and the 
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5 addition of the EcoRl site. The addition of this site is predicted to result in an overall reduction 
in size of the band hybridizing to the probe. The digested DNA is separated on a 1% TAE 
Agarose gel, transferred to a nylon membrane, crosslinked with a UV light (StrataLinker) and 
hybridized with a 32-P labeled DNA probe. This probe does not hybridize to DNA sequences 
that are on the targeting vector but to a position that is adjacent to the site of homologous 
10 integration. 

Selected cells are then injected into a blastocyst (or other stage of development suitable 
for the purposes of creating a viable animal, such as, for example, a morula) of an animal (e.g., a 
mouse) to form chimeras (see e.g., Bradley, A. in Teratocarcinomas and Embryonic Stem Cells: 
A Practical Approach, E. J. Robertson, e&, IRL, Oxford, pp. 113-152 (1987)). Alternatively, 
15 selected ES cells can be allowed to aggregate with dissociated mouse embryo cells to form the 
uli aggregation chimera. A chimeric embryo can then be implanted into a suitable pseudopregnant 
f [ female foster animal and the embryo brought to term. Chimeric progeny harbouring the 
Y\ homologously recombined DNA in their germ cells can be used to breed animals in which all 
q! cells of the animal contain the homologously recombined DNA. In one embodiment, chimeric 
26 progeny mice are used to generate a mouse with a heterozygous disruption in the target gene. 
;^ Heterozygous knockout mice can then be mated. It is well know in the art that typically l A of the 

offspring of such matings will have a homozygous disruption in the target gene. 
%l The heterozygous and homozygous knockout mice can then be compared to normal, wild 

N type mice to determine whether disruption of the target gene causes phenotypic changes, 
25 especially pathological changes. For example, heterozygous and homozygous mice may be 
evaluated for phenotypic changes by physical examination, necropsy, histology, clinical 
chemistry, complete blood count, body weight, organ weights, and cytological evaluation of 
bone marrow. 

In one embodiment, the phenotype (or phenotypic change) associated with a disruption in 
30 the target gene is placed into or stored in a database. Preferably, the database includes: (i) 
genotypic data (e.g., identification of the disrupted gene) and (ii) phenotypic data (e.g., 
phenotype(s) resulting from the gene disruption) associated with the genotypic data. The 
database is preferably electronic. In addition, the database is preferably combined with a search 
tool so that the database is searchable. 
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5 The present invention further contemplates conditional knockout animals, such as those 

produced using recombination methods. Bacteriophage PI Cre recombinase and flp recombinase 
from yeast plasmids are two non-limiting examples of site-specific DNA recombinase enzymes 
which cleave DNA at specific target sites (lox P sites for cre recombinase and frt sites for flp 
recombinase) and catalyze a ligation of this DNA to a second cleaved site. A large number of 
10 suitable alternative site-specific recombinases have been described, and their genes can be used 
in accordance with the method of the present invention. Such recombinases include the Int 
recombinase of bacteriophage X (with or without Xis) (Weisberg, R. et. al., in Lambda II, 
(Hendrix, R., et al, Eds.), Cold Spring Harbor Press, Cold Spring Harbor, NY, pp. 21 1-50 
(1983), herein incorporated by reference); Tpnl and the 3-lactamase transposons (Mercier, et al, 
lfo 7. Bacterial, 172:3745-57 (1990)); the Tn3 resolvase (Flanagan & Fennewald 7 Molec. Biol, 
;|J 206:295-304 (1989); Stark, et al, Cell 58:779-90 (1989)); the yeast recombinases (Matsuzaki, et 
!--** al, 7. Bacteriol, 172:610-18 (1990)); the B. subtilis SpoIVC recombinase (Sato, et al, J. 
|7i Bacteriol 172: 1092-98 (1990;); the Flp recombinase (Schwartz & Sadowski, J. Molec.Biol, 
205:647-658 (1989); Parsons, et al, 7 Biol Chem., 265:4527-33 (1990); Golic & Lindquist, 
2Q Cell, 59:499-509 (1989); Amin, et al, 7. Molec. Biol, 214:55-72 (1990)); the Hin recombinase 
H | (Glasgow, et al, 7. Biol Chem., 264:10072-82 (1989)); immunoglobulin recombinases (Malynn, 
H; et al, Cell 54:453-460 (1988)); and the Cin recombinase (Haffter & Bickle, EMBO 7, 
Jj 7:3991-3996 (1988); Hubner, et al, 7. Molec. Biol, 205:493-500 (1989)), all herein incorporated 
|5S8 by reference. Such systems are discussed by Echols (7 Biol Chem. 265:14697-14700 (1990)); 
25 de Villartay {Nature, 335:170-74 (1988)); Craig, {Ann. Rev, Genet, 22:77-105 (1988)); 
Poyart-Salmeron, etal, {EMBO J. 8:2425-33 (1989)); Hunger-Bertling, et al {Mol Cell 
Biochem., 92:107-16 (1990)); and Cregg & Madden {Mol Gen. Genet, 219:320-23 (1989)), all 
herein incorporated by reference. 

Cre has been purified to homogeneity, and its reaction with the loxP site has been 
30 extensively characterized (Abremski & Hess 7 Mol Biol 259: 1509-14 (1984), herein 

incorporated by reference). Cre protein has a molecular weight of 35,000 and can be obtained 
commercially from New England Nuclear/Du Pont. The cre gene (which encodes the Cre 
protein) has been cloned and expressed (Abremski, et al Cell 32:1301-11 (1983), herein 
incorporated by reference). The Cre protein mediates recombination between two loxP 
35 sequences (Sternberg, et al Cold Spring Harbor Symp. Quant Biol 45:297-309 (1981)), which 
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5 may be present on the same or different DNA molecule. Because the internal spacer sequence of 
the loxP site is asymmetrical, two loxP sites can exhibit directionality relative to one another 
(Hoess & Abremski Proc. Natl Acad. ScL U.S.A. 81:1026-29 (1984;). Thus, when two sites on 
the same DNA molecule are in a directly repeated orientation, Cre will excise the DNA between 
the sites (Abremski, et al Cell 32:1301-11 (1983)). However, if the sites are inverted with 

10 respect to each other, the DNA between them is not excised after recombination but is simply 
inverted. Thus, a circular DNA molecule having two loxP sites in direct orientation will 
recombine to produce two smaller circles, whereas circular molecules having two loxP sites in an 
inverted orientation simply invert the DNA sequences flanked by the loxP sites. In addition, 
recombinase action can result in reciprocal exchange of regions distal to the target site when 

1§. targets are present on separate DNA molecules. 

4) Recombinases have important application for characterizing gene function in knockout 

;". s "l models. When the constructs described herein are used to disrupt target genes, a fusion 
^ transcript can be produced when insertion of the positive selection marker occurs downstream 
i3* (3) of the translation initiation site of the target gene. The fusion transcript could result in some 
2D level of protein expression with unknown consequence. It has been suggested that insertion of a 
positive selection marker gene can affect the expression of nearby genes. These effects may 
j«b make it difficult to determine gene function after a knockout event since one could not discern 
j«; whether a given phenotype is associated with the inactivation of a gene, or the transcription of 
l SBfe nearby genes. Both potential problems are solved by exploiting recombinase activity. When the 
25 positive selection marker is flanked by recombinase sites in the same orientation, the addition of 
the corresponding recombinase will result in the removal of the positive selection marker. In this 
way, effects caused by the positive selection marker or expression of fusion transcripts are 
avoided. 

In one embodiment, purified recombinase enzyme is provided to the cell by direct 
30 microinjection. In another embodiment, recombinase is expressed from a co-transfected 

construct or vector in which the recombinase gene is operably linked to a functional promoter. 
An additional aspect of this embodiment is the use of tissue-specific or inducible recombinase 
constructs which allow the choice of when and where recombination occurs. One method for 
practicing the inducible forms of recombinase-mediated recombination involves the use of 
35 vectors that use inducible or tissue-specific promoters or other gene regulatory elements to 
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5 express the desired recombinase activity. The inducible expression elements are preferably 
operatively positioned to allow the inducible control or activation of expression of the desired 
recombinase activity. Examples of such inducible promoters or other gene regulatory elements 
include, but are not limited to, tetracycline, metallothionine, ecdysone, and other steroid- 
responsive promoters, rapamycin responsive promoters, and the like (No, et al Proc. Natl Acad. 
10 Sci. USA, 93:3346-51 (1996); Furth, etal Proc. Natl Acad. Scl USA, 91:9302-6 (1994)). 

Additional control elements that can be used include promoters requiring specific transcription 
factors such as viral, promoters. Vectors incorporating such promoters would only express 
recombinase activity in cells that express the necessary transcription factors. 

Other methods known in the art may be used to produce the transgenic cells and 

IS i knockout mice of the present invention. For example, the methods described in U.S. Patent No. 

;jj 5,464,764; U.S. Patent No. 5,487,992; U.S. Patent No. 5,627,059; and U.S. Patent No. 5,631,153 
may be used to produce a transgenic cell or knockout mice comprising a disruption in a gene 

j . | encoding a retina-specific nuclear receptor as provided by the present invention. 

2G Models for Disease 

The cell- and animal-based systems described herein can be utilized as models for 
^ diseases. Animals of any species, including, but not limited to, mice, rats, rabbits, guinea pigs, 
Cl pigs, micro-pigs, goats, and non-human primates, e.g., baboons, monkeys, and chimpanzees may 
be used to generate disease animal models. In addition, cells from humans may be used. These 
25 systems may be used in a variety of applications. For example, the cell- and animal-based model 
systems may be used to further characterize retina-specific nuclear receptor genes. Such assays 
may be utilized as part of screening strategies designed to identify compounds which are capable 
of ameliorating disease symptoms. Thus, the animal- and cell-based models may be used to 
identify drugs, pharmaceuticals, therapies and interventions which may be effective in treating 
30 disease. 

Cell-based systems may be used to identify compounds which may act to ameliorate 
disease symptoms. For example, such cell systems may be exposed to a compound suspected of 
exhibiting an ability to ameliorate disease symptoms, at a sufficient concentration and for a time 
sufficient to elicit such an amelioration of disease symptoms in the exposed cells. After 
35 exposure, the cells are examined to determine whether one or more of the disease cellular 
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5 phenotypes has been altered to resemble a more normal or more wild type, non-disease 
phenotype. 

In addition, animal-based disease systems, such as those described herein, may be used to 
identify compounds capable of ameliorating disease symptoms. Such animal models may be 
used as test substrates for the identification of drugs, pharmaceuticals, therapies, and 
10 interventions which may be effective in treating a disease or other phenotypic characteristic of 
the animal. For example, animal models may be exposed to a compound or agent suspected of 
exhibiting an ability to ameliorate disease symptoms, at a sufficient concentration and for a time 
sufficient to elicit such an amelioration of disease symptoms in the exposed animals. The 
response of the animals to the exposure may be monitored by assessing the reversal of disorders 
ljSL associated with the disease. Exposure may involve treating mother animals during gestation of 
! 4] the model animals described herein, thereby exposing embryos or fetuses to the compound or 

agent which may prevent or ameliorate the disease or phenotype. Neonatal, juvenile, and adult 
; ; = animals can also be exposed. 

More particularly, using the animal models of the invention, specifically, knockout mice, 
2Q methods of identifying compounds are provided, preferably, on the basis of the ability of the 
compounds to affect physiological, histological or behavioral phenotypes associated with a 
M disruption in a gene that encodes a retina-specific nuclear receptor. 

■fj In one embodiment, the present invention provides a method of identifying agents having 

jS!!S an effect on retina-specific nuclear receptor expression or function. The method includes 

25 administering an effective amount of the agent to a vertebrate animal, preferably a mouse, having 
a disruption in a gene encoding a retina-specific nuclear receptor. The method includes 
measuring a physiological response of the animal, for example, to the agent, and comparing the 
physiological response of such animal to a control animal, wherein the physiological response of 
the animal comprising a gene encoding a retina-specific nuclear receptor as compared to the 

30 control animal indicates the specificity of the agent. A "physiological response" is any 

biological or physical parameter of an animal which can be measured. Molecular assays (e.g., 
gene transcription, protein production and degradation rates), physical parameters (e.g., exercise 
physiology tests, measurement of various parameters of respiration, measurement of heart rate or 
blood pressure, measurement of bleeding time, aPTT.T, or TT), and cellular assays (e.g,. 
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5 immunohistochemical assays of cell surface markers, or the ability of cells to aggregate or 
proliferate) can be used to assess a physiological response. 

The animals and cells of the present invention may by utilized as models for diseases, 
disorders, or conditions associated with phenotypes relating to a disruption in a gene encoding a 
retina-specific nuclear receptor. 
10 The present invention also provides a unique animal model for testing and developing 

new treatments relating to the behavioral phenotypes. Analysis of the behavioral phenotype 
allows for the development of an animal model useful for testing, for instance, the efficacy of 
proposed genetic and pharmacological therapies for human genetic diseases, such as 
neurological, neuropsychological, or psychotic illnesses. 
15, : A statistical analysis of the various behaviors measured can be carried out using any 

'4} conventional statistical program routinely used by those skilled in the art (such as, for example, 
"Analysis of Variance" or ANOVA). A "p" value of about 0.05 or less is generally considered 
rj to be statistically significant, although slightly higher p values may still be indicative of 
; " statistically significant differences. To statistically analyze abnormal behavior, a comparison is 
2Q made between the behavior of a transgenic animal (or a group thereof) to the behavior of a wild- 
j s ; = type mouse (or a group thereof), typically under certain prescribed conditions. "Abnormal 
M; behavior" as used herein refers to behavior exhibited by an animal having a disruption in the 
q target gene, e.g. transgenic animal, which differs from an animal without a disruption in the 
r " target gene, e.g. wild-type mouse. Abnormal behavior consists of any number of standard 
25 behaviors that can be objectively measured (or observed) and compared. In the case of 
comparison, it is preferred that the change be statistically significant to confirm that there is 
indeed a meaningful behavioral difference between the knockout animal and the wild-type 
control animal. Examples of behaviors which may be measured or observed include, but are not 
limited to, ataxia, rapid limb movement, eye movement, breathing, motor activity, cognition, 
30 emotional behaviors, social behaviors, hyperactivity, hypersensitivity, anxiety, impaired 
learning, abnormal reward behavior, and abnormal social interaction, such as aggression 

A series of tests may be used to measure the behavioral phenotype of the animal models 
of the present invention, including neurological and neuropsychological tests to identify 
abnormal behavior. These tests may be used to measure abnormal behavior relating to, for 
35 example, learning and memory, eating, pain, aggression, sexual reproduction, anxiety, 
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5 depression, schizophrenia, and drug abuse. (See, e.g., Crawley and Paylor, Hormones and 
Behavior 31:197-211 (1997)), 

The social interaction test involves exposing a mouse to other animals in a variety of 
settings. The social behaviors of the animals (e.g., touching, climbing, sniffing, and mating) are 
subsequently evaluated. Differences in behaviors can then be statistically analyzed and 
10 compared (See, e.g., S. E. File, et al, Pharmacol Bioch. Behav. 22:941-944 (1985); R. R. 
Holson, Phys. Behav. 37:239-247 (1986)). Examplary behavioral tests include the following. 

The mouse startle response test typically involves exposing the animal to a sensory 
(typically auditory) stimulus and measuring the startle response of the animal (see, e.g., M. A. 
Geyer, et al, Brain Res, Bull 25:485-498 (1990); Paylor and Crawley, Psychopharmacology 
15 a 132:169-180 (1997)). A pre-pulse inhibition test can also be used, in which the percent 
4) inhibition (from a normal startle response) is measured by "cueing" the animal first with a brief 
j^L low-intensity pre-pulse prior to the startle pulse. 

f 5 '! The electric shock test generally involves exposure to an electrified surface and 

ip measurement of subsequent behaviors such as, for example, motor activity, learning, social 
20 behaviors. The behaviors are measured and statistically analyzed using standard statistical tests. 
^ (See, e.g., G. J. Kant, et al, Pharm. Bioch. Behav. 20:793-797 (1984); N. J. Leidenheimer, et al., 
M Pharmacol. Bioch. Behav. 30:351-355 (1988)). 

The tail-pinch or immobilization test involves applying pressure to the tail of the animal 
j?s ' p and/or restraining the animal's movements. Motor activity, social behavior, and cognitive 
25 behavior are examples of the areas that are measured. (See, e.g., M. Bertolucci D'Angic, et ah, 
Neurochem. 55:1208-1214 (1990)). 

The novelty test generally comprises exposure to a novel environment and/or novel 
objects. The animal's motor behavior in the novel environment and/or around the novel object 
are measured and statistically analyzed. (See, e.g., D. K. Reinstein, et al, Pharm. Bioch. Behav. 
30 17:193-202 (1982); B. Poucet, Behav. NeuroscL 103:1009-10016 (1989); R. R, Holson, et al, 
Phys. Behav. 37:231-238 (1986)). This test may be used to detect visual processing deficiencies 
or defects. 

The learned helplessness test involves exposure to stresses, for example, noxious stimuli, 
which cannot be affected by the animal's behavior. The animal's behavior can be statistically 
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5 analyzed using various standard statistical tests. (See, e.g., A. Leshner, et aL, Behav. Neural Biol 
26:497-501 (1979)). 

Alternatively, a tail suspension test may be used, in which the "immobile" time of the 
mouse is measured when suspended "upside-down" by its tail. This is a measure of whether the 
animal struggles, an indicator of depression. In humans, depression is believed to result from 

10 feelings of a lack of control over one's life or situation. It is believed that a depressive state can 
be elicited in animals by repeatedly subjecting them to aversive situations over which they have 
no control. A condition of "learned helplessness" is eventually reached, in which the animal will 
stop trying to change its circumstances and simply accept its fate. Animals that stop struggling 
sooner are believed to be more prone to depression. Studies have shown that the administration 

i t 5- of certain antidepressant drugs prior to testing increases the amount of time that animals struggle 

h*l before giving up. 

yl The Morris water-maze test comprises learning spatial orientations in water and 

\[] subsequently measuring the animal's behaviors, such as, for example, by counting the number of 
uH incorrect choices. The behaviors measured are statistically analyzed using standard statistical 
2Q tests. (See, e.g., E. M. Spruijt, et ah, Brain Res. 527:192-197 (1990)). 

Alternatively, a Y-shaped maze may be used (see, e.g., McFarland, D.J., Pharmacology, 
W* Biochemistry and Behavior 32:723-726 (1989); Dellu, F., et al., Neurobiology of Learning and 
j«l Memory 73:31-48 (2000)). The Y-maze is generally believed to be a test of cognitive ability. 
The dimensions of each arm of the Y-maze can be, for example, approximately 40 cm x 8 cm x 
25 20 cm, although other dimensions may be used. Each arm can also have, for example, sixteen 
equally spaced photobeams to automatically detect movement within the arms. At least two 
different tests can be performed using such a Y-maze. In a continuous Y-maze paradigm, mice 
are allowed to explore all three arms of a Y-maze for, e.g., approximately 10 minutes. The 
animals are continuously tracked using photobeam detection grids, and the data can be used to 
30 measure spontaneous alteration and positive bias behavior. Spontaneous alteration refers to the 
natural tendency of a "normal" animal to visit the least familiar arm of a maze. An alternation is 
scored when the animal makes two consecutive turns in the same direction, thus representing a 
sequence of visits to the least recently entered arm of the maze. Position bias determines 
egocentrically defined responses by measuring the animal's tendency to favor turning in one 
35 direction over another. Therefore, the test can detect differences in an animal's ability to 
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5 navigate on the basis of allocentric or egocentric mechanisms. The two-trial Y-maze memory 
test measures response to novelty and spatial memory based on a free-choice exploration 
paradigm. During the first trial (acquisition), the animals are allowed to freely visit two arms of 
the Y-maze for, e.g., approximately 15 minutes. The third arm is blocked off during this trial. 
The second trial (retrieval) is performed after an intertrial interval of, e.g., approximately 2 

10 hours. During the retrieval trial, the blocked arm is opened and the animal is allowed access to 
all three arms for, e.g., approximately 5 minutes. Data are collected during the retrieval trial and 
analyzed for the number and duration of visits to each arm. Because the three arms of the maze 
are virtually identical, discrimination between novelty and familiarity is dependent on 
"environmental" spatial cues around the room relative to the position of each arm. Changes in 

15 arm entry and duration of time spent in the novel arm in a transgenic animal model may be 

y % indicative of a role of that gene in mediating novelty and recognition processes. 
W The passive avoidance or shuttle box test generally involves exposure to two or more 

environments, one of which is noxious, providing a choice to be learned by the animal 
Behavioral measures include, for example, response latency, number of correct responses, and 

2CT consistency of response. (See, e.g., R. Ader, et ah, Psychon. Set 26:125-128 (1972); R. R. 
□ Holson, Phys. Behav. 37:221-230 (1986)). Alternatively, a zero-maze can be used. In a zero- 

maze, the animals can, for example, be placed in a closed quadrant of an elevated annular 
3] platform having, e.g., 2 open and 2 closed quadrants, and are allowed to explore for 

approximately 5 minutes. This paradigm exploits an approach-avoidance conflict between 

25 normal exploratory activity and an aversion to open spaces in rodents. This test measures 

anxiety levels and can be used to evaluate the effectiveness of anti-anxiolytic drugs. The time 
spent in open quadrants versus closed quadrants may be recorded automatically, with, for 
example, the placement of photobeams at each transition site. 

The food avoidance test involves exposure to novel food and objectively measuring, for 

30 example, food intake and intake latency. The behaviors measured are statistically analyzed using 
standard statistical tests. (See, e.g., B. A. Campbell, et al., 7. Comp. Physiol Psychol. 67:15-22 
(1969)). 

The elevated plus-maze test comprises exposure to a maze, without sides, on a platform, 
the animal's behavior is objectively measured by counting the number of maze entries and maze 
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5 learning. The behavior is statistically analyzed using standard statistical tests. (See, e.g., H. A. 
Baldwin, et al, Brain Res. Bull, 20:603-606 (1988)). 

The stimulant-induced hyperactivity test involves injection of stimulant drugs (e.g., 
amphetamines, cocaine, PCP, and the like), and objectively measuring, for example, motor 
activity, social interactions, cognitive behavior. The animal's behaviors are statistically analyzed 
10 using standard statistical tests. (See, e.g., P. B. S. Clarke, et al., Psychopharmacology 96:511-520 
(1988); P. Kuczenski, et al, J. Neuroscience 11:2703-2712 (1991)). 

The self-stimulation test generally comprises providing the mouse with the opportunity to 
regulate electrical and/or chemical stimuli to its own brain. Behavior is measured by frequency 
and pattern of self-stimulation. Such behaviors are statistically analyzed using standard statistical 
15 tests. (See, e.g., S. Nassif, et al, Brain Res., 332:247-257 (1985); W. L. Isaac, et al, Behav. 
% Neurosci. 103:345-355 (1989)). 

;* s f The reward test involves shaping a variety of behaviors, e.g., motor, cognitive, and social, 

M measuring, for example, rapidity and reliability of behavioral change, and statistically analyzing 
ffl the behaviors measured. (See, e.g., L. E. Jarrard, et al., Exp. Brain Res. 61:519-530 (1986)). 

2(T The DRL (differential reinforcement to low rates of responding) performance test 

□ involves exposure to intermittent reward paradigms and measuring the number of proper 
r l responses, e.g., lever pressing. Such behavior is statistically analyzed using standard statistical 
;2J tests. (See, e.g., J. D. Sinden, et al., Behav. Neurosci. 100:320-329 (1986); V. Nalwa, et al., 
U Behav Brain Res. 17:73-76 (1985); and A. J. Nonneman, et al., J. Comp. Physiol. Psych. 95:588- 

25 602 (1981)). 

The spatial learning test involves exposure to a complex novel environment, measuring 
the rapidity and extent of spatial learning, and statistically analyzing the behaviors measured, 
(See, e.g., N. Pitsikas, et al., Pharm. Bioch. Behav. 38:931-934 (1991); B. poucet, et al., Brain 
Res. 37:269-280 (1990); D. Christie, et al., Brain Res. 37:263-268 (1990); and F. Van Haaren, et 

30 al., Behav. Neurosci. 102:481-488 (1988)). Alternatively, an open-field (of) test may be used, in 
which the greater distance traveled for a given amount of time is a measure of the activity level 
and anxiety of the animal. When the open field is a novel environment, it is believed that an 
approach-avoidance situation is created, in which the animal is "torn" between the drive to 
explore and the drive to protect itself. Because the chamber is lighted and has no places to hide 

35 other than the corners, it is expected that a "normal" mouse will spend more time in the corners 
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5 and around the periphery than it will in the center where there is no place to hide. "Normal" 
mice will, however, venture into the central regions as they explore more and more of the 
chamber. It can then be extrapolated that especially anxious mice will spend most of their time 
in the corners, with relatively little or no exploration of the central region, whereas bold (i.e., less 
anxious) mice will travel a greater distance, showing little preference for the periphery versus the 

10 central region. 

The visual, somatosensory and auditory neglect tests generally comprise exposure to a 
sensory stimulus, objectively measuring, for example, orientating responses, and statistically 
analyzing the behaviors measured. (See, e.g., J. M. Vargo, et al., Exp, Neurol 102:199-209 
(1988)). 

l£l The consummatory behavior test generally comprises feeding and drinking, and 

objectively measuring quantity of consumption. The behavior measured is statistically analyzed 
K using standard statistical tests. (See, e.g., P. J. Fletcher, et al, Psychopharmacol 102:301-308 
\ 4 (1990); M. G. Corda, et al.„ Proc. Natl Acad. Sci. USA 80:2072-2076 (1983)). 

A visual discrimination test can also be used to evaluate the visual processing of an 
20 animal. One or two similar objects are placed in an open field and the animal is allowed to 

;„! 

l7i explore for about 5-10 minutes. The time spent exploring each object (proximity to, i.e., 
r ^ movement within, e.g., about 3-5 cm of the object is considered exploration of an object) is 
Q recorded. The animal is then removed from the open field, and the objects are replaced by a 
similar object and a novel object. The animal is returned to the open field and the percent time 
25 spent exploring the novel object over the old object is measured (again, over about a 5-10 minute 
span). "Normal" animals will typically spend a higher percentage of time exploring the novel 
object rather than the old object. If a delay is imposed between sampling and testing, the 
memory task becomes more hippocampal-dependent. If no delay is imposed, the task is more 
based on simple visual discrimination. This test can also be used for olfactory discrimination, in 
30 which the objects (preferably, simple blocks) can be sprayed or otherwise treated to hold an 
odor. This test can also be used to determine if the animal can make gustatory discriminations; 
animals that return to the previously eaten food instead of novel food exhibit gustatory 
neophobia. 

A hot plate analgesia test can be used to evaluate an animal's sensitivity to heat or painful 
35 stimuli. For example, a mouse can be placed on an approximately 55°C hot plate and the 
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5 mouse's response latency (e.g., time to pick up and lick a hind paw) can be recorded. These 
responses are not reflexes, but rather "higher" responses requiring cortical involvement. This 
test may be used to evaluate a nociceptive disorder. 

An accelerating rotarod test may be used to measure coordination and balance in mice. 
Animals can be, for example, placed on a rod that acts like a rotating treadmill (or rolling log). 
10 The rotarod can be made to rotate slowly at first and then progressively faster until it reaches a 
speed of, e.g., approximately 60 rpm. The mice must continually reposition themselves in order 
to avoid falling off. The animals are preferably tested in at least three trials, a minimum of 20 
minutes apart. Those mice that are able to stay on the rod the longest are believed to have better 
coordination and balance. 
15 a A metrazol administration test can be used to screen animals for varying susceptibilities 

; ii to seizures or similar events. For example, a 5mg/ml solution of metrazol can be infused through 
u the tail vein of a mouse at a rate of, e.g., approximately 0.375 ml/min. The infusion will cause 
H all mice to experience seizures, followed by death. Those mice that enter the seizure stage the 
01 soonest are believed to be more prone to seizures. Four distinct physiological stages can be 
2Q recorded: soon after the start of infusion, the mice will exhibit a noticeable "twitch", followed by 
;^ a series of seizures, ending in a final tensing of the body known as "tonic extension", which is 
h * followed by death. 

'/■ I ; i 
: j! s 

Hz? 

h " Target Gene Products 

25 The present invention further contemplates use of the target gene sequence to produce 

target gene products. Target gene products may include proteins that represent functionally 
equivalent gene products. Such an equivalent gene product may contain deletions, additions or 
substitutions of amino acid residues within the amino acid sequence encoded by the gene 
sequences described herein, but which result in a silent change, thus producing a functionally 

30 equivalent target gene product. Amino acid substitutions may be made on the basis of similarity 
in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of 
the residues involved. 

For example, nonpolar (hydrophobic) amino acids include alanine, leucine, isoleucine, 
valine, proline, phenylalanine, tryptophan, and methionine; polar neutral amino acids include 

35 glycine, serine, threonine, cysteine, tyrosine, asparagine, and glutamine; positively charged 
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5 (basic) amino acids include arginine, lysine, and histidine; and negatively charged (acidic) amino 
acids include aspartic acid and glutamic acid. "Functionally equivalent", as utilized herein, refers 
to a protein capable of exhibiting a substantially similar in vivo activity as the endogenous gene 
products encoded by the target gene sequences. Alternatively, when utilized as part of an assay, 
"functionally equivalent" may refer to peptides capable of interacting with other cellular or 

10 extracellular molecules in a manner substantially similar to the way in which the corresponding 
portion of the endogenous gene product would. 

Other protein products useful according to the methods of the invention are peptides 
derived from or based on the target gene produced by recombinant or synthetic means (derived 
peptides). 

15 . Target gene products may be produced by recombinant DNA technology using 

■J J techniques well known in the art. Thus, methods for preparing the gene polypeptides and 
'{Z peptides of the invention by expressing nucleic acid encoding gene sequences are described 
; ^ herein. Methods which are well known to those skilled in the art can be used to construct 
fl! expression vectors containing gene protein coding sequences and appropriate 
20 transcriptional/translational control signals. These methods include, for example, in vitro 
W recombinant DNA techniques, synthetic techniques and in vivo recombination/genetic 
l«b recombination (see, e.g., Sambrook, et al., 1989, supra, and Ausubel, et ah, 1989, supra). 
%l Alternatively, RNA capable of encoding gene protein sequences may be chemically synthesized 
N : using, for example, automated synthesizers (see, e.g. Oligonucleotide Synthesis: A Practical 
25 Approach, Gait, M. J, ed., IRL Press, Oxford (1984)). 

A variety of host-expression vector systems may be utilized to express the gene coding 
sequences of the invention. Such host-expression systems represent vehicles by which the 
coding sequences of interest may be produced and subsequently purified, but also represent cells 
which may, when transformed or transfected with the appropriate nucleotide coding sequences, 
30 exhibit the gene protein of the invention in situ. These include but are not limited to 

microorganisms such as bacteria (e.g., E. coli, B. subtilis) transformed with recombinant 
bacteriophage DNA, plasmid DNA or cosmid DNA expression vectors containing gene protein 
coding sequences; yeast (e.g. Saccharomyces, Pichia) transformed with recombinant yeast 
expression vectors containing the gene protein coding sequences; insect cell systems infected 
35 with recombinant virus expression vectors (e.g., baculovirus) containing the gene protein coding 
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5 sequences; plant cell systems infected with recombinant virus expression vectors (e.g., 

cauliflower mosaic virus, CaMV; tobacco mosaic virus, TMV) or transformed with recombinant 
plasmid expression vectors (e.g., Ti plasmid) containing gene protein coding sequences; or 
mammalian cell systems (e.g. COS, CHO, BHK, 293, 3T3) harboring recombinant expression 
constructs containing promoters derived from the genome of mammalian cells (e.g., 
10 metallothionein promoter) or from mammalian viruses (e.g., the adenovirus late promoter; the 
vaccinia virus 7.5 K promoter). 

In bacterial systems, a number of expression vectors may be advantageously selected 
depending upon the use intended for the gene protein being expressed. For example, when a 
large quantity of such a protein is to be produced, for the generation of antibodies or to screen 
15 peptide libraries, for example, vectors which direct the expression of high levels of fusion protein 

products that are readily purified may be desirable. Such vectors include, but are not limited, to 
iff the E. coli expression vector pUR278 (Ruther et al, EMBO J., 2:1791-94 (1983)), in which the 
^ gene protein coding sequence may be ligated individually into the vector in frame with the lac Z 
§\ coding region so that a fusion protein is produced; pIN vectors (Inouye & Inouye, Nucleic Acids 
20 " Res., 13:3101-09 (1985); Van Heeke et al, J. Biol Chem., 264:5503-9 (1989)); and the like. 
U pGEX vectors may also be used to express foreign polypeptides as fusion proteins with 
^ glutathione S-transferase (GST). In general, such fusion proteins are soluble and can easily be 
%l purified from lysed cells by adsorption to glutathione-agarose beads followed by elution in the 
i e! * presence of free glutathione. The pGEX vectors are designed to include thrombin or factor Xa 
25 protease cleavage sites so that the cloned target gene protein can be released from the GST 
moiety. 

In a preferred embodiment, full length cDNA sequences are appended with in-frame Bam 
HI sites at the amino terminus and Eco RI sites at the carboxyl terminus using standard PCR 
methodologies (Innis, et al. (eds) PCR Protocols: A Guide to Methods and Applications, 
30 Academic Press, San Diego (1990)) and ligated into the pGEX-2TK vector (Pharmacia, Uppsala, 
Sweden). The resulting cDNA construct contains a kinase recognition site at the amino terminus 
for radioactive labeling and glutathione S-transferase sequences at the carboxyl terminus for 
affinity purification (Nilsson, et al, EMBO J., 4: 1075-80 (1985); Zabeau et al, EMBO J., 1: 
1217-24 (1982)). 
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5 In an insect system, Autographa californica nuclear polyhedrosis virus (AcNPV) is used 

as a vector to express foreign genes. The virus grows in Spodoptera frugiperda cells. The gene 
coding sequence may be cloned individually into non-essential regions (for example the 
polyhedrin gene) of the virus and placed under control of an AcNPV promoter (for example the 
polyhedrin promoter). Successful insertion of gene coding sequence will result in inactivation of 
10 the polyhedrin gene and production of non-occluded recombinant virus {i.e., virus lacking the 
proteinaceous coat coded for by the polyhedrin gene). These recombinant viruses are then used 
to infect Spodoptera frugiperda cells in which the inserted gene is expressed (see, e.g., Smith, et 
aU J. Virol 46: 584-93 (1983); U.S. Pat. No. 4,745,051). 

In mammalian host cells, a number of viral-based expression systems may be utilized. In 
15 cases where an adenovirus is used as an expression vector, the gene coding sequence of interest 
H n may be ligated to an adenovirus transcription/translation control complex, e.g., the late promoter 

and tripartite leader sequence. This chimeric gene may then be inserted in the adenovirus 
H genome by in vitro or in vivo recombination. Insertion in a non-essential region of the viral 
§\ genome (e.g., region El or E3) will result in a recombinant virus that is viable and capable of 
20 expressing gene protein in infected hosts, (e.g., see Logan et al, Proc. Natl. Acad. Sci USA, 
Q 81:3655-59 (1984)). Specific initiation signals may also be required for efficient translation of 
inserted gene coding sequences. These signals include the ATG initiation codon and adjacent 
%l sequences. In cases where an entire gene, including its own initiation codon and adjacent 
M sequences, is inserted into the appropriate expression vector, no additional translational control 
25 signals may be needed. However, in cases where only a portion of the gene coding sequence is 
inserted, exogenous translational control signals, including, perhaps, the ATG initiation codon, 
must be provided. Furthermore, the initiation codon must be in phase with the reading frame of 
the desired coding sequence to ensure translation of the entire insert. These exogenous 
translational control signals and initiation codons can be of a variety of origins, both natural and 
30 synthetic. The efficiency of expression may be enhanced by the inclusion of appropriate 
transcription enhancer elements, transcription terminators, etc. (see Bitter, et al, Methods in 
EnzymoL, 153:516-44(1987)). 

In addition, a host cell strain may be chosen which modulates the expression of the 
inserted sequences, or modifies and processes the gene product in the specific fashion desired. 
35 Such modifications (e.g., glycosylation) and processing (e.g., cleavage) of protein products may 
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5 be important for the function of the protein. Different host cells have characteristic and specific 
mechanisms for the post-translational processing and modification of proteins. Appropriate cell 
lines or host systems can be chosen to ensure the correct modification and processing of the 
foreign protein expressed. To this end, eukaryotic host cells which possess the cellular 
machinery for proper processing of the primary transcript, glycosylation, and phosphorylation of 
10 the gene product may be used. Such mammalian host cells include but are not limited to CHO, 
VERO, BHK, HeLa, COS, MDCK, 293, 3T3, WI38, etc. 

For long-term, high-yield production of recombinant proteins, stable expression is 
preferred. For example, cell lines which stably express the gene protein may be engineered. 
Rather than using expression vectors which contain viral origins of replication, host cells can be 
15 transformed with DNA controlled by appropriate expression control elements (e.g., promoter, 

enhancer, sequences, transcription terminators, polyadenylation sites, etc.), and a selectable 
01 marker. Following the introduction of the foreign DNA, engineered cells may be allowed to 

grow for 1-2 days in an enriched media, and then are switched to a selective media. The 
^- selectable marker in the recombinant plasmid confers resistance to the selection and allows cells 
2<Sf b which stably integrate the plasmid into their chromosomes and grow, to form foci which in turn 
fll can be cloned and expanded into cell lines. This method may advantageously be used to 
^; engineer cell lines which express the gene protein. Such engineered cell lines may be 

particularly useful in screening and evaluation of compounds that affect the endogenous activity 
U-i of the gene protein. 

25 In a preferred embodiment, control of timing and/or quantity of expression of the 

recombinant protein can be controlled using an inducible expression construct. Inducible 
constructs and systems for inducible expression of recombinant proteins will be well known to 
those skilled in the art. Examples of such inducible promoters or other gene regulatory elements 
include, but are not limited to, tetracycline, metallothionine, ecdysone, and other steroid- 

30 responsive promoters, rapamycin responsive promoters, and the like (No, et al, Proc. Natl 

Acad. Sci. USA, 93:3346-51 (1996); Furth, etal, Proc. Natl Acad. Sci. USA, 91:9302-6 (1994)). 
Additional control elements that can be used include promoters requiring specific transcription 
factors such as viral, particularly HIV, promoters. In one in embodiment, a Tet inducible gene 
expression system is utilized. (Gossen et al, Proc. Natl Acad. Sci USA, 89:5547-51 (1992); 

35 Gossen, et al, Science, 268: 1766-69 (1995)). Tet Expression Systems are based on two 



5 regulatory elements derived from the tetracycline-resistance operon of the E. coli TnlO 

transposon — the tetracycline repressor protein (TetR) and the tetracycline operator sequence 
(tetO) to which TetR binds. Using such a system, expression of the recombinant protein is placed 
under the control of the tetO operator sequence and transfected or transformed into a host celL In 
the presence of TetR, which is co-transfected into the host cell, expression of the recombinant 
10 protein is repressed due to binding of the TetR protein to the tetO regulatory element. High- 
level, regulated gene expression can then be induced in response to varying concentrations of 
tetracycline (Tc) or Tc derivatives such as doxycycline (Dox), which compete with tetO elements 
for binding to TetR. Constructs and materials for tet inducible gene expression are available 
commercially from CLONTECH Laboratories, Inc., Palo Alto, CA. 
15 When used as a component in an assay system, the gene protein may be labeled, either 

iil directly or indirectly, to facilitate detection of a complex formed between the gene protein and a 
T[ test substance. Any of a variety of suitable labeling systems may be used including but not 
| wfe s limited to radioisotopes such as 1251; enzyme labeling systems that generate a detectable 
!-fi calorimetric signal or light when exposed to substrate; and fluorescent labels. Where 
26 recombinant DNA technology is used to produce the gene protein for such assay systems, it may 
P be advantageous to engineer fusion proteins that can facilitate labeling, immobilization and/or 
\ n b detection. 

% Indirect labeling involves the use of a protein, such as a labeled antibody, which 

M specifically binds to the gene product. Such antibodies include but are not limited to polyclonal, 
25 monoclonal, chimeric, single chain, Fab fragments and fragments produced by a Fab expression 
library. 

Production of Antibodies 

Described herein are methods for the production of antibodies capable of specifically 

30 recognizing one or more epitopes. Such antibodies may include, but are not limited to 

polyclonal antibodies, monoclonal antibodies (mAbs), humanized or chimeric antibodies, single 
chain antibodies, Fab fragments, F(ab')2 fragments, fragments produced by a Fab expression 
library, anti-idiotypic (anti-Id) antibodies, and epitope-binding fragments of any of the above. 
Such antibodies may be used, for example, in the detection of a target gene in a biological 

35 sample, or, alternatively, as a method for the inhibition of abnormal target gene activity. Thus, 
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5 such antibodies may be utilized as part of disease treatment methods, and/or may be used as part 
of diagnostic techniques whereby patients may be tested for abnormal levels of target gene 
proteins, or for the presence of abnormal forms of the such proteins. 

For the production of antibodies, various host animals may be immunized by injection 
with the target gene, its expression product or a portion thereof. Such host animals may include 

10 but are not limited to rabbits, mice, and rats, to name but a few. Various adjuvants may be used 
to increase the immunological response, depending on the host species, including but not limited 
to Freund's (complete and incomplete), mineral gels such as aluminum hydroxide, surface active 
substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole 
limpet hemocyanin, dinitrophenol, and potentially useful human adjuvants such as BCG (bacille 

15 Calmette-Gueriri) and Corynebacterium parvum. 

\$ Polyclonal antibodies are heterogeneous populations of antibody molecules derived from 

the sera of animals immunized with an antigen, such as target gene product, or an antigenic 
functional derivative thereof. For the production of polyclonal antibodies, host animals such as 
ill those described above, may be immunized by injection with gene product supplemented with 
26 adjuvants as also described above. 

Monoclonal antibodies, which are homogeneous populations of antibodies to a particular 
W*. antigen, may be obtained by any technique which provides for the production of antibody 
!5| molecules by continuous cell lines in culture. These include, but are not limited to the 
M hybridoma technique of Kohler and Milstein, Nature, 256:495-7 (1975); and U.S. Pat. No. 
25 4,376,1 10), the human B-cell hybridoma technique (Kosbor, et al, Immunology Today, 4:72 
(1983); Cote, et al, Proc. Natl Acad Sci USA, 80:2026-30 (1983)), and the EBV-hybridoma 
technique (Cole, et al, in Monoclonal Antibodies And Cancer Therapy, Alan R. Liss, Inc., New 
York, pp. 77-96 (1985)). Such antibodies may be of any immunoglobulin class including IgG, 
IgM, IgE, IgA, IgD and any subclass thereof. The hybridoma producing the mAb of this 
30 invention may be cultivated in vitro or in vivo. Production of high titers of mAbs in vivo makes 
this the presently preferred method of production. 

In addition, techniques developed for the production of "chimeric antibodies" (Morrison, 
et al, Proa Natl Acad. Scl, 81:6851-6855 (1984); Takeda, et al, Nature, 314:452-54 (1985)) 
by splicing the genes from a mouse antibody molecule of appropriate antigen specificity together 
35 with genes from a human antibody molecule of appropriate biological activity can be used. A 
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5 chimeric antibody is a molecule in which different portions are derived from different animal 
species, such as those having a variable region derived from a murine mAb and a human 
immunoglobulin constant region. 

Alternatively, techniques described for the production of single chain antibodies (U.S. 
Pat. No. 4,946,778; Bird, Science 242:423-26 (1988); Huston, et al, Proc. Natl Acad. ScL USA, 
10 85:5879-83 (1988); and Ward, et ah, Nature, 334:544-46 (1989)) can be adapted to produce 
gene-single chain antibodies. Single chain antibodies are formed by linking the heavy and light 
chain fragments of the Fv region via an amino acid bridge, resulting in a single chain 
polypeptide. 

Antibody fragments which recognize specific epitopes may be generated by known 
15 techniques. For example, such fragments include but are not limited to: the F(ab')2 fragments 
•j| which can be produced by pepsin digestion of the antibody molecule and the Fab fragments 

which can be generated by reducing the disulfide bridges of the F(ab02 fragments. 
X\ Alternatively, Fab expression libraries may be constructed (Huse, et al, Science, 246: 1275-81 
□! (1989)) to allow rapid and easy identification of monoclonal Fab fragments with the desired 

j-.L 

20 specificity. 

Screening for Therapeutic Agents 
% Cells that contain and express target gene sequences may be used to screen for 

N : therapeutic agents. Such cells may include non-recombinant monocyte cell lines, such as U937 

25 (ATCC# CRL-1593), THP-1 (ATCC# TIB-202), and P388D1 (ATCC# TIB-63); endothelial 
cells such as HUVEC's and bovine aortic endothelial cells (BAEC's); as well as generic 
mammalian cell lines such as HeLa cells and COS cells, e.g., COS-7 (ATCC# CRL-1651). 
Further, such cells may include recombinant, transgenic cell lines. For example, the knockout 
mice of the invention may be used to generate cell lines, containing one or more cell types 

30 involved in a disease, that can be used as cell culture models for that disorder. While cells, 

tissues, and primary cultures derived from the disease transgenic animals of the invention may be 
utilized, the generation of continuous cell lines is preferred. For examples of techniques which 
may be used to derive a continuous cell line from the transgenic animals, see Small, et ah, Mol 
Cell Biol, 5:642-48 (1985). 
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5 Target gene sequences may be introduced into, and overexpressed in, the genome of the 

cell of interest. In order to overexpress a target gene sequence, the coding portion of the target 
gene sequence may be ligated to a regulatory sequence which is capable of driving gene 
expression in the cell type of interest. Such regulatory regions will be well known to those of 
skill in the art, and may be utilized in the absence of undue experimentation. Target gene 
10 sequences may also be disrupted or underexpressed. Cells having target gene disruptions or 
underexpressed target gene sequences may be used, for example, to screen for agents capable of 
affecting alternative pathways which compensate for any loss of function attributable to the 
disruption or underexpression. 

In vitro systems may be designed to identify compounds capable of binding the target 
15 gene products. Such compounds may include, but are not limited to, peptides made of D-and/or 
L-configuration amino acids (in, for example, the form of random peptide libraries; see e.g., 
Lam, et al, Nature, 354:82-4 (1991)), phosphopeptides (in, for example, the form of random or 
H partially degenerate, directed phosphopeptide libraries; see, e.g., Songyang, et al, Cell, 72:767- 
78 (1993)), antibodies, and small organic or inorganic molecules. Compounds identified may be 
20 ' useful, for example, in modulating the activity of target gene proteins, preferably mutant target 
gene proteins; elaborating the biological function of the target gene protein; or screening for 
i«i compounds that disrupt normal target gene interactions or themselves disrupt such interactions. 
zl The principle of the assays used to identify compounds that bind to the target gene 

H protein involves preparing a reaction mixture of the target gene protein and the test compound 
25 under conditions and for a time sufficient to allow the two components to interact and bind, thus 
forming a complex which can be removed and/or detected in the reaction mixture. These assays 
can be conducted in a variety of ways. For example, one method to conduct such an assay would 
involve anchoring the target gene protein or the test substance onto a solid phase and detecting 
target protein/test substance complexes anchored on the solid phase at the end of the reaction. In 
30 one embodiment of such a method, the target gene protein may be anchored onto a solid surface, 
and the test compound, which is not anchored, may be labeled, either directly or indirectly. 

In practice, microtitre plates are conveniently utilized. The anchored component may be 
immobilized by non-covalent or covalent attachments. Non-covalent attachment may be 
accomplished simply by coating the solid surface with a solution of the protein and drying. 
35 Alternatively, an immobilized antibody, preferably a monoclonal antibody, specific for the 
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5 protein may be used to anchor the protein to the solid surface. The surfaces may be prepared in 
advance and stored. 

In order to conduct the assay, the nonimmobilized component is added to the coated 
surface containing the anchored component. After the reaction is complete, unreacted 
components are removed (e.g., by washing) under conditions such that any complexes formed 
10 will remain immobilized on the solid surface. The detection of complexes anchored on the solid 
surface can be accomplished in a number of ways. Where the previously nonimmobilized 
component is pre-labeled, the detection of label immobilized on the surface indicates that 
complexes were formed. Where the previously nonimmobilized component is not pre-labeled, 
an indirect label can be used to detect complexes anchored on the surface; e.g., using a labeled 
15 antibody specific for the previously nonimmobilized component (the antibody, in turn, may be 
iX\ directly labeled or indirectly labeled with a labeled anti-Ig antibody). 
*?[ Alternatively, a reaction can be conducted in a liquid phase, the reaction products 

M separated from unreacted components, and complexes detected; e.g., using an immobilized 
31 antibody specific for target gene product or the test compound to anchor any complexes formed 
20"" in solution, and a labeled antibody specific for the other component of the possible complex to 
O detect anchored complexes. 

u Compounds that are shown to bind to a particular target gene product through one of the 

% methods described above can be further tested for their ability to elicit a biochemical response 
U from the target gene protein. Agonists, antagonists and/or inhibitors of the expression product 
25 can be identified utilizing assays well known in the art. 

Antisense, Ribozymes, and Antibodies 

Other agents which may be used as therapeutics include the target gene, its expression 
product(s) and functional fragments thereof. Additionally, agents which reduce or inhibit mutant 
30 target gene activity may be used to ameliorate disease symptoms. Such agents include antisense, 
ribozyme, and triple helix molecules. Techniques for the production and use of such molecules 
are well known to those of skill in the art. 

Anti-sense RNA and DNA molecules act to directly block the translation of mRNA by 
hybridizing to targeted mRNA and preventing protein translation. With respect to antisense 
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5 DNA, oligodeoxyribonucleotides derived from the translation initiation site, e.g., between the - 
10 and +10 regions of the target gene nucleotide sequence of interest, are preferred. 

Ribozymes are enzymatic RNA molecules capable of catalyzing the specific cleavage of 
RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the 
ribozyme molecule to complementary target RNA, followed by an endonucleolytic cleavage. 

10 The composition of ribozyme molecules must include one or more sequences complementary to 
the target gene mRNA, and must include the well known catalytic sequence responsible for 
mRNA cleavage. For this sequence, see U.S. Pat. No. 5,093,246, which is incorporated by 
reference herein in its entirety. As such within the scope of the invention are engineered 
hammerhead motif ribozyme molecules that specifically and efficiently catalyze endonucleolytic 

15 cleavage of RNA sequences encoding target gene proteins. 

I] j Specific ribozyme cleavage sites within any potential RNA target are initially identified 

;; 5 f by scanning the molecule of interest for ribozyme cleavage sites which include the following 
sequences, GUA, GUU and GUC. Once identified, short RNA sequences of between 15 and 20 
ribonucleotides corresponding to the region of the target gene containing the cleavage site may 
2<Sr be evaluated for predicted structural features, such as secondary structure, that may render the 
Q oligonucleotide sequence unsuitable. The suitability of candidate sequences may also be 
H evaluated by testing their accessibility to hybridization with complementary oligonucleotides, 
|;|! using ribonuclease protection assays. 

Nucleic acid molecules to be used in triple helix formation for the inhibition of 
25 transcription should be single stranded and composed of deoxyribonucleotides. The base 

composition of these oligonucleotides must be designed to promote triple helix formation via 
Hoogsteen base pairing rules, which generally require sizeable stretches of either purines or 
pyrimi dines to be present on one strand of a duplex. Nucleotide sequences may be pyrimidine- 
based, which will result in TAT and CGC triplets across the three associated strands of the 
30 resulting triple helix. The pyrimidine-rich molecules provide base complementarity to a purine- 
rich region of a single strand of the duplex in a parallel orientation to that strand. In addition, 
nucleic acid molecules may be chosen that are purine-rich, for example, containing a stretch of G 
residues. These molecules will form a triple helix with a DNA duplex that is rich in GC pairs, in 
which the majority of the purine residues are located on a single strand of the targeted duplex, 
35 resulting in GGC triplets across the three strands in the triplex. 
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5 Alternatively, the potential sequences that can be targeted for triple helix formation may 

be increased by creating a so called "switchback" nucleic acid molecule. Switchback molecules 
are synthesized in an alternating 5-3', 3 -5 'manner, such that they base pair with first one strand 
of a duplex and then the other, eliminating the necessity for a sizeable stretch of either purines or 
pyrimidines to be present on one strand of a duplex. 
10 It is possible that the antisense, ribozyme, and/or triple helix molecules described herein 

may reduce or inhibit the transcription (triple helix) and/or translation (antisense, ribozyme) of 
mRNA produced by both normal and mutant target gene alleles. In order to ensure that 
substantially normal levels of target gene activity are maintained, nucleic acid molecules that 
encode and express target gene polypeptides exhibiting normal activity may be introduced into 
15 . cells that do not contain sequences susceptible to whatever antisense, ribozyme, or triple helix 

:J| treatments are being utilized. Alternatively, it may be preferable to coadminister normal target 

n gene protein into the cell or tissue in order to maintain the requisite level of cellular or tissue 

f 8 . target gene activity. 

m Anti-sense RNA and DNA, ribozyme, and triple helix molecules of the invention may be 

20 prepared by any method known in the art for the synthesis of DNA and RNA molecules. These 
[ w * include techniques for chemically synthesizing oligodeoxyribonucleotides and 
M oligoribonucleotides well known in the art such as for example solid phase phosphoramidite 
pi chemical synthesis. Alternatively, RNA molecules may be generated by in vitro and in vivo 
transcription of DNA sequences encoding the antisense RNA molecule. Such DNA sequences 
25 may be incorporated into a wide variety of vectors which incorporate suitable RNA polymerase 
promoters such as the T7 or SP6 polymerase promoters. Alternatively, antisense cDNA 
constructs that synthesize antisense RNA constitutively or inducibly, depending on the promoter 
used, can be introduced stably into cell lines. 

Various well-known modifications to the DNA molecules may be introduced as a means 
30 of increasing intracellular stability and half-life. Possible modifications include but are not 

limited to the addition of flanking sequences of ribonucleotides or deoxyribonucleotides to the 5' 
and/or 3' ends of the molecule or the use of phosphorothioate or 2' O-methyl rather than 
phosphodiesterase linkages within the oligodeoxyribonucleotide backbone. 

Antibodies that are both specific for target gene protein, and in particular, mutant gene 
35 protein, and interfere with its activity may be used to inhibit mutant target gene function. Such 
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5 antibodies may be generated against the proteins themselves or against peptides corresponding to 
portions of the proteins using standard techniques known in the art and as also described herein. 
Such antibodies include but are not limited to polyclonal, monoclonal, Fab fragments, single 
chain antibodies, chimeric antibodies, etc. 

In instances where the target gene protein is intracellular and whole antibodies are used, 
10 internalizing antibodies may be preferred. However, lipofectin liposomes may be used to deliver 
the antibody or a fragment of the Fab region which binds to the target gene epitope into cells. 
Where fragments of the antibody are used, the smallest inhibitory fragment which binds to the 
target or expanded target protein's binding domain is preferred. For example, peptides having an 
amino acid sequence corresponding to the domain of the variable region of the antibody that 
binds to the target gene protein may be used. Such peptides may be synthesized chemically or 
'il produced via recombinant DNA technology using methods well known in the art (see, e.g., 

Creighton, Proteins: Structures and Molecular Principles (1984) W.H. Freeman, New York 1983, 
: ^ supra; and Sambrook, et al, 1989, supra). Alternatively, single chain neutralizing antibodies 
W which bind to intracellular target gene epitopes may also be administered. Such single chain 
2Q antibodies may be administered, for example, by expressing nucleotide sequences encoding 
!y* single-chain antibodies within the target cell population by utilizing, for example, techniques 
M such as those described in Marasco, et al, Proc. Natl Acad. Sci. USA, 90:7889-93 (1993). 
/-I RNA sequences encoding target gene protein may be directly administered to a patient 

^ s exhibiting disease symptoms, at a concentration sufficient to produce a level of target gene 
25 protein such that disease symptoms are ameliorated. Patients may be treated by gene 

replacement therapy. One or more copies of a normal target gene, or a portion of the gene that 
directs the production of a normal target gene protein with target gene function, may be inserted 
into cells using vectors which include, but are not limited to adenovirus, adeno-associated virus, 
and retrovirus vectors, in addition to other particles that introduce DNA into cells, such as 
30 liposomes. Additionally, techniques such as those described above may be utilized for the 
introduction of normal target gene sequences into human cells. 

Cells, preferably, autologous cells, containing normal target gene expressing gene 
sequences may then be introduced or reintroduced into the patient at positions which allow for 
the amelioration of disease symptoms. 
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Pharmaceutical Compositions, Effective Dosages, and Routes of Administration 

The identified compounds that inhibit target mutant gene expression, synthesis and/or 
activity can be administered to a patient at therapeutically effective doses to treat or ameliorate 
10 the disease. A therapeutically effective dose refers to that amount of the compound sufficient to 
result in amelioration of symptoms of the disease. 

Toxicity and therapeutic efficacy of such compounds can be determined by standard 
pharmaceutical procedures in cell cultures or experimental animals, e.g., for determining the 
LD50 (the dose lethal to 50% of the population) and the ED50 (the dose therapeutically effective 
15 in 50% of the population). The dose ratio between toxic and therapeutic effects is the therapeutic 
index and it can be expressed as the ratio LD50/ED50. Compounds which exhibit large therapeutic 
W indices are preferred. While compounds that exhibit toxic side effects may be used, care should 
L 7 =~ be taken to design a delivery system that targets such compounds to the site of affected tissue in 
^ order to minimize potential damage to uninfected cells and, thereby, reduce side effects. 
2(H' The data obtained from the cell culture assays and animal studies can be used in 

Q formulating a range of dosage for use in humans. The dosage of such compounds lies preferably 

within a range of circulating concentrations that include the ED50 with little or no toxicity. The 
OJ dosage may vary within this range depending upon the dosage form employed and the route of 
administration utilized. For any compound used in the method of the invention, the 
25 therapeutically effective dose can be estimated initially from cell culture assays, A dose may be 
formulated in animal models to achieve a circulating plasma concentration range that includes 
the IC50 (Le., the concentration of the test compound which achieves a half-maximal inhibition of 
symptoms) as determined in cell culture. Such information can be used to more accurately 
determine useful doses in humans. Levels in plasma may be measured, for example, by high 
30 performance liquid chromatography. 

Pharmaceutical compositions for use in accordance with the present invention may be 
formulated in conventional manner using one or more physiologically acceptable carriers or 
excipients. Thus, the compounds and their physiologically acceptable salts and solvates may be 
formulated for administration by inhalation or insufflation (either through the mouth or the nose) 
35 or oral, buccal, parenteral, topical, subcutaneous, intraperitoneal, intraveneous, intrapleural, 
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5 intraoccular, intraarterial, or rectal administration. It is also contemplated that pharmaceutical 
compositions may be administered with other products that potentiate the activity of the 
compound and optionally, may include other therapeutic ingredients. 

For oral administration, the pharmaceutical compositions may take the form of, for 
example, tablets or capsules prepared by conventional means with pharmaceutically acceptable 
10 excipients such as binding agents (e.g., pregelatinised maize starch, polyvinylpyrrolidone or 
hydroxypropyl methylcellulose); fillers (e.g., lactose, microcrystalline cellulose or calcium 
hydrogen phosphate); lubricants (e.g., magnesium stearate, talc or silica); disintegrants (e.g., 
potato starch or sodium starch glycolate); or wetting agents (e.g., sodium lauryl sulphate). The 
tablets may be coated by methods well known in the art. Liquid preparations for oral 
l£ % administration may take the form of, for example, solutions, syrups or suspensions, or they may 
41 be presented as a dry product for constitution with water or other suitable vehicle before use. 
ul Such liquid preparations may be prepared by conventional means with pharmaceutically 
f" acceptable additives such as suspending agents (e.g., sorbitol syrup, cellulose derivatives or 
01 hydrogenated edible fats); emulsifying agents (e.g., lecithin or acacia); non-aqueous vehicles 
(e.g., almond oil, oily esters, ethyl alcohol or fractionated vegetable oils); and preservatives (e.g., 
methyl or propyl-p-hydroxybenzoates or sorbic acid). The preparations may also contain buffer 
1 ! « salts, flavoring, coloring and sweetening agents as appropriate. 

r;i Preparations for oral administration may be suitably formulated to give controlled release 

^ E of the active compound. 
25 For buccal administration the compositions may take the form of tablets or lozenges 

formulated in conventional manner. 

For administration by inhalation, the compounds for use according to the present 
invention are conveniently delivered in the form of an aerosol spray presentation from 
pressurized packs or a nebuliser, with the use of a suitable propellant, e.g., 
30 dichlorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide or 
other suitable gas. In the case of a pressurized aerosol the dosage unit may be determined by 
providing a valve to deliver a metered amount. Capsules and cartridges of e.g. gelatin for use in 
an inhaler or insufflator may be formulated containing a powder mix of the compound and a 
suitable powder base such as lactose or starch. 
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5 The compounds may be formulated for parenteral administration by injection, e.g., by 

bolus injection or continuous infusion. Formulations for injection may be presented in unit 
dosage form, e.g., in ampoules or in multi-dose containers, with an added preservative. The 
compositions may take such forms as suspensions, solutions or emulsions in oily or aqueous 
vehicles, and may contain formulatory agents such as suspending, stabilizing and/or dispersing 

10 agents. Alternatively, the active ingredient may be in powder form for constitution with a 
suitable vehicle, e.g., sterile pyrogen-free water, before use. 

The compounds may also be formulated in rectal compositions such as suppositories or 
retention enemas, e.g., containing conventional suppository bases such as cocoa butter or other 
glycerides. Oral ingestion is possibly the easiest method of taking any medication. Such a route 

15 of administration, is generally simple and straightforward and is frequently the least inconvenient 

; ;r:; £ 

H i| or unpleasant route of administration from the patient's point of view. However, this involves 
l f[ passing the material through the stomach, which is a hostile environment for many materials, 
H including proteins and other biologically active compositions. As the acidic, hydrolytic and 
q\ proteolytic environment of the stomach has evolved efficiently to digest proteinaceous materials 
26"" into amino acids and oligopeptides for subsequent anabolism, it is hardly surprising that very 
Q little or any of a wide variety of biologically active proteinaceous material, if simply taken 

orally, would survive its passage through the stomach to be taken up by the body in the small 
\H intestine. The result, is that many proteinaceous medicaments must be taken in through another 
M method, such as parenterally, often by subcutaneous, intramuscular or intravenous injection. 
25 Pharmaceutical compositions may also include various buffers (e.g., Tris, acetate, 

phosphate), solubilizers (e.g., Tween, Polysorbate), carriers such as human serum albumin, 
preservatives (thimerosol, benzyl alcohol) and anti-oxidants such as ascorbic acid in order to 
stabilize pharmaceutical activity. The stabilizing agent may be a detergent, such as tween-20, 
tween-80, NP-40 or Triton X-100. EBP may also be incorporated into particulate preparations of 
30 polymeric compounds for controlled delivery to a patient over an extended period of time. A 
more extensive survey of components in pharmaceutical compositions is found in Remington's 
Pharmaceutical Sciences, 18th ed., A. R. Gennaro, ed., Mack Publishing, Easton, Pa. (1990). 

In addition to the formulations described previously, the compounds may also be 
formulated as a depot preparation. Such long acting formulations may be administered by 
35 implantation (for example subcutaneously or intramuscularly) or by intramuscular injection. 
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5 Thus, for example, the compounds may be formulated with suitable polymeric or hydrophobic 
materials (for example as an emulsion in an acceptable oil) or ion exchange resins, or as 
sparingly soluble derivatives, for example, as a sparingly soluble salt. 

The compositions may, if desired, be presented in a pack or dispenser device which may 
contain one or more unit dosage forms containing the active ingredient. The pack may for 
10 example comprise metal or plastic foil, such as a blister pack. The pack or dispenser device may 
be accompanied by instructions for administration. 

Diagnostics 

A variety of methods may be employed to diagnose disease conditions associated with 
15 the target gene. Specifically, reagents may be used, for example, for the detection of the 
rj presence of target gene mutations, or the detection of either over or under expression of target 
|* gene mRNA. 

! ??fe According to the diagnostic and prognostic method of the present invention, alteration of 

yj the wild-type target gene locus is detected. In addition, the method can be performed by 
2ff\ detecting the wild-type target gene locus and confirming the lack of a predisposition or 
^ neoplasia. "Alteration of a wild-type gene" encompasses all forms of mutations including 
; s d deletions, insertions and point mutations in the coding and noncoding regions. Deletions may be 
% s% of the entire gene or only a portion of the gene. Point mutations may result in stop codons, 
P frameshift mutations or amino acid substitutions. Somatic mutations are those which occur only 
25 in certain tissues, e.g., in the tumor tissue, and are not inherited in the germline. Germline 
mutations can be found in any of a body's tissues and are inherited. If only a single allele is 
somatically mutated, an early neoplastic state is indicated. However, if both alleles are mutated, 
then a late neoplastic state may be indicated. The finding of gene mutations thus provides both 
diagnostic and prognostic information. A target gene allele which is not deleted (e.g., that found 
30 on the sister chromosome to a chromosome carrying a target gene deletion) can be screened for 
other mutations, such as insertions, small deletions, and point mutations. Mutations found in 
tumor tissues may be linked to decreased expression of the target gene product. However, 
mutations leading to non-functional gene products may also be linked to a cancerous state. Point 
mutational events may occur in regulatory regions, such as in the promoter of the gene, leading 
35 to loss or diminution of expression of the mRNA. Point mutations may also abolish proper RNA 
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5 processing, leading to loss of expression of the target gene product, or a decrease in mRNA 
stability or translation efficiency. 

One test available for detecting mutations in a candidate locus is to directly compare 
genomic target sequences from cancer patients with those from a control population. 
Alternatively, one could sequence messenger RNA after amplification, e.g., by PCR, thereby 

10 eliminating the necessity of determining the exon structure of the candidate gene. Mutations 
from cancer patients falling outside the coding region of the target gene can be detected by 
examining the non-coding regions, such as introns and regulatory sequences near or within the 
target gene. An early indication that mutations in noncoding regions are important may come 
from Northern blot experiments that reveal messenger RNA molecules of abnormal size or 

15 abundance in cancer patients as compared to control individuals. 

L ;J The methods described herein may be performed, for example, by utilizing pre-packaged 

□j diagnostic kits comprising at least one specific gene nucleic acid or anti-gene antibody reagent 
j.^ described herein, which may be conveniently used, e.g., in clinical settings, to diagnose patients 
exhibiting disease symptoms or at risk for developing disease. 
2<^ Any cell type or tissue, preferably monocytes, endothelial cells, or smooth muscle cells, 

n| in which the gene is expressed may be utilized in the diagnostics described below. 
M: - DNA or RNA from the cell type or tissue to be analyzed may easily be isolated using 

ill procedures which are well known to those in the art. Diagnostic procedures may also be 
£*[ performed in situ directly upon tissue sections (fixed and/or frozen) of patient tissue obtained 
25 from biopsies or resections, such that no nucleic acid purification is necessary. Nucleic acid 
reagents may be used as probes and/or primers for such in situ procedures (see, for example, 
Nuovo, PCR In Situ Hybridization: Protocols and Applications, Raven Press, N.Y. (1992)). 

Gene nucleotide sequences, either RNA or DNA, may, for example, be used in 
hybridization or amplification assays of biological samples to detect disease-related gene 
30 structures and expression. Such assays may include, but are not limited to, Southern or Northern 
analyses, restriction fragment length polymorphism assays, single stranded conformational 
polymorphism analyses, in situ hybridization assays, and polymerase chain reaction analyses. 
Such analyses may reveal both quantitative aspects of the expression pattern of the gene, and 
qualitative aspects of the gene expression and/or gene composition. That is, such aspects may 
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5 include, for example, point mutations, insertions, deletions, chromosomal rearrangements, and/or 
activation or inactivation of gene expression. 

Preferred diagnostic methods for the detection of gene-specific nucleic acid molecules 
may involve for example, contacting and incubating nucleic acids, derived from the cell type or 
tissue being analyzed, with one or more labeled nucleic acid reagents under conditions favorable 
10 for the specific annealing of these reagents to their complementary sequences within the nucleic 
acid molecule of interest. Preferably, the lengths of these nucleic acid reagents are at least 9 to 
30 nucleotides. After incubation, all non-annealed nucleic acids are removed from the nucleic 
acid:fingerprint molecule hybrid. The presence of nucleic acids from the fingerprint tissue which 
have hybridized, if any such molecules exist, is then detected. Using such a detection scheme, 
15 the nucleic acid from the tissue or cell type of interest may be immobilized, for example, to a 
c| solid support such as a membrane, or a plastic surface such as that on a microtitre plate or 
!S% polystyrene beads. In this case, after incubation, non-annealed, labeled nucleic acid reagents are 
r h easily removed. Detection of the remaining, annealed, labeled nucleic acid reagents is 
accomplished using standard techniques well-known to those in the art. 

ill 

2(T J Alternative diagnostic methods for the detection of gene-specific nucleic acid molecules 

^ may involve their amplification, e.g., by PCR (the experimental embodiment set forth in Mullis 
uj U.S. Pat. No. 4,683,202 (1987)), ligase chain reaction (Barany, Proc. Natl Acad. Sci. USA, 
C 88:189-93 (1991)), self sustained sequence replication (Guatelli, et ah, Proc. Natl Acad. Sci. 
Q USA, 87:1874-78 (1990)), transcriptional amplification system (Kwoh, et al, Proc. Natl Acad. 
25~" Sci USA, 86:1173-77 (1989)), Q-Beta Replicase (Lizardi et al, Bio/Technology, 6:1197 (1988)), 
or any other nucleic acid amplification method, followed by the detection of the amplified 
molecules using techniques well known to those of skill in the art. These detection schemes are 
especially useful for the detection of nucleic acid molecules if such molecules are present in very 
low numbers. 

30 In one embodiment of such a detection scheme, a cDNA molecule is obtained from an 

RNA molecule of interest (e.g., by reverse transcription of the RNA molecule into cDNA). Cell 
types or tissues from which such RNA may be isolated include any tissue in which wild type 
fingerprint gene is known to be expressed, including, but not limited, to monocytes, endothelium, 
and/or smooth muscle. A sequence within the cDNA is then used as the template for a nucleic 

35 acid amplification reaction, such as a PCR amplification reaction, or the like. The nucleic acid 
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5 reagents used as synthesis initiation reagents (e.g., primers) in the reverse transcription and 

nucleic acid amplification steps of this method may be chosen from among the gene nucleic acid 
reagents described herein. The preferred lengths of such nucleic acid reagents are at least 15-30 
nucleotides. For detection of the amplified product, the nucleic acid amplification may be 
performed using radioactively or non-radioactively labeled nucleotides. Alternatively, enough 
10 amplified product may be made such that the product may be visualized by standard ethidium 
bromide staining or by utilizing any other suitable nucleic acid staining method. 

Antibodies directed against wild type or mutant gene peptides may also be used as 
disease diagnostics and prognostics. Such diagnostic methods, may be used to detect 
abnormalities in the level of gene protein expression, or abnormalities in the structure and/or 
15 tissue, cellular, or subcellular location of fingerprint gene protein. Structural differences may 

Q include, for example, differences in the size, electronegativity, or antigenicity of the mutant 

i?4 fingerprint gene protein relative to the normal fingerprint gene protein. 

j ?sfe Protein from the tissue or cell type to be analyzed may easily be detected or isolated 

UJ using techniques which are well known to those of skill in the art, including but not limited to 
2(£ ^ western blot analysis. For a detailed explanation of methods for carrying out western blot 

;;L analysis, see Sambrook, et al. (1989) supra, at Chapter 18. The protein detection and isolation 

UJ methods employed herein may also be such as those described in Harlow and Lane, for example, 
(Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 

O New York (1988)). 

25 Preferred diagnostic methods for the detection of wild type or mutant gene peptide 

molecules may involve, for example, immunoassays wherein fingerprint gene peptides are 
detected by their interaction with an anti -fingerprint gene-specific peptide antibody. 

For example, antibodies, or fragments of antibodies useful in the present invention may 
be used to quantitatively or qualitatively detect the presence of wild type or mutant gene 

30 peptides. This can be accomplished, for example, by immunofluorescence techniques employing 
a fluorescently labeled antibody (see below) coupled with light microscopic, flow cytometric, or 
fluorimetric detection. Such techniques are especially preferred if the fingerprint gene peptides 
are expressed on the cell surface. 

The antibodies (or fragments thereof) useful in the present invention may, additionally, 

35 be employed histologically, as in immunofluorescence or immunoelectron microscopy, for in 
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5 situ detection of fingerprint gene peptides. In situ detection may be accomplished by removing a 
histological specimen from a patient, and applying thereto a labeled antibody of the present 
invention. The antibody (or fragment) is preferably applied by overlaying the labeled antibody 
(or fragment) onto a biological sample. Through the use of such a procedure, it is possible to 
determine not only the presence of the fingerprint gene peptides, but also their distribution in the 
10 examined tissue. Using the present invention, those of ordinary skill will readily perceive that 
any of a wide variety of histological methods (such as staining procedures) can be modified in 
order to achieve such in situ detection. 

Immunoassays for wild type, mutant, or expanded fingerprint gene peptides typically 
comprise incubating a biological sample, such as a biological fluid, a tissue extract, freshly 
15 harvested cells, or cells which have been incubated in tissue culture, in the presence of a 
Cl detectably labeled antibody capable of identifying fingerprint gene peptides, and detecting the 
fv| bound antibody by any of a number of techniques well known in the art. 
\ H The biological sample may be brought in contact with and immobilized onto a solid 

■Ml phase support or carrier such as nitrocellulose, or other solid support which is capable of 
2Q,j= immobilizing cells, cell particles or soluble proteins. The support may then be washed with 
% suitable buffers followed by treatment with the detectably labeled gene-specific antibody. The 
Ml solid phase support may then be washed with the buffer a second time to remove unbound 
□l antibody. The amount of bound label on solid support may then be detected by conventional 
M means. 

25 By "solid phase support or carrier" is intended any support capable of binding an antigen 

or an antibody. Well-known supports or carriers include glass, polystyrene, polypropylene, 
polyethylene, dextran, nylon, amylases, natural and modified celluloses, polyacrylamides, 
gabbros, and magnetite. The nature of the carrier can be either soluble to some extent or 
insoluble for the purposes of the present invention. The support material may have virtually any 

30 possible structural configuration so long as the coupled molecule is capable of binding to an 
antigen or antibody. Thus, the support configuration may be spherical, as in a bead, or 
cylindrical, as in the inside surface of a test tube, or the external surface of a rod. Alternatively, 
the surface may be flat such as a sheet, test strip, etc. Preferred supports include polystyrene 
beads. Those skilled in the art will know many other suitable carriers for binding antibody or 

35 antigen, or will be able to ascertain the same by use of routine experimentation. 
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5 The binding activity of a given lot of anti-wild type or -mutant fingerprint gene peptide 

antibody may be determined according to well known methods. Those skilled in the art will be 
able to determine operative and optimal assay conditions for each determination by employing 
routine experimentation. 

One of the ways in which the gene peptide-specific antibody can be detectably labeled is 
10 by linking the same to an enzyme and using it in an enzyme immunoassay (EIA) (Voller, Ric 
Clin Lab, 8:289-98 (1978) ["The Enzyme Linked Immunosorbent Assay (ELISA)", Diagnostic 
Horizons 2:1-7, 1978, Microbiological Associates Quarterly Publication, Walkersville, Md.]; 
Voller, et al, J- Clin. Pathol, 31:507-20 (1978); Butler, Metk EnzymoL, 73:482-523 (1981); 
Maggio (ed.), Enzyme Immunoassay, CRC Press, Boca Raton, Fla. (1980); Ishikawa, et al, 
15 (eds.) Enzyme Immunoassay, Igaku-Shoin, Tokyo (1981)). The enzyme which is bound to the 
_i« % antibody will react with an appropriate substrate, preferably a chromogenic substrate, in such a 
manner as to produce a chemical moiety which can be detected, for example, by 
spectrophotometry, fluorimetric or by visual means. Enzymes which can be used to detectably 
|,i j label the antibody include, but are not limited to, malate dehydrogenase, staphylococcal 
2&** nuclease, delta-5-steroid isomerase, yeast alcohol dehydrogenase, alpha-glycerophosphate, 
r- dehydrogenase, triose phosphate isomerase, horseradish peroxidase, alkaline phosphatase, 
hj asparaginase, glucose oxidase, beta-galaetosidase, ribonuclease, urease, catalase, glucose-6- 

phosphate dehydrogenase, glucoamylase and acetylcholinesterase. The detection can be 
Q accomplished by colorimetric methods which employ a chromogenic substrate for the enzyme. 
25 Detection may also be accomplished by visual comparison of the extent of enzymatic reaction of 
a substrate in comparison with similarly prepared standards. 

Detection may also be accomplished using any of a variety of other immunoassays. For 
example, by radioactively labeling the antibodies or antibody fragments, it is possible to detect 
fingerprint gene wild type, mutant, or expanded peptides through the use of a radioimmunoassay 
30 (RIA) (see, e.g., Weintraub, B., Principles of Radioimmunoassays, Seventh Training Course on 
Radioligand Assay Techniques, The Endocrine Society, March, 1986). The radioactive isotope 
can be detected by such means as the use of a gamma counter or a scintillation counter or by 
autoradiography. 

It is also possible to label the antibody with a fluorescent compound. When the 
35 fluorescently labeled antibody is exposed to light of the proper wave length, its presence can then 
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5 be detected due to fluorescence. Among the most commonly used fluorescent labeling 
compounds are fluorescein isothiocyanate, rhodamine, phycoerythrin, phycocyanin, 
allophycocyanin, o-phthaldehyde and fluorescamine. 

The antibody can also be detectably labeled using fluorescence emitting metals such as 
152Eu, or others of the lanthanide series. These metals can be attached to the antibody using 
10 such metal chelating groups as diethylenetriaminepentacetic acid (DTP A) or ethylenediamine- 
tetraacetic acid (EDTA). 

The antibody also can be detectably labeled by coupling it to a chemiluminescent 
compound. The presence of the chemiluminescent-tagged antibody is then determined by 
detecting the presence of luminescence that arises during the course of a chemical reaction. 
15 Examples of particularly useful chemiluminescent labeling compounds are luminol, isoluminol, 
theromatic acridinium ester, imidazole, acridinium salt and oxalate ester. 

Likewise, a bioluminescent compound may be used to label the antibody of the present 
W* invention. Bioluminescence is a type of chemiluminescence found in biological systems in, 
j ,| which a catalytic protein increases the efficiency of the chemiluminescent reaction. The 
2<p* presence of a bioluminescent protein is determined by detecting the presence of luminescence. 
is Important bioluminescent compounds for purposes of labeling are luciferin, luciferase and 
aequorin. 

Q Throughout this application, various publications, patents and published patent 

25 ; applications are referred to by an identifying citation. The disclosures of these publications, 

patents and published patent specifications referenced in this application are hereby incorporated 
by reference into the present disclosure to more fully describe the state of the art to which this 
invention pertains. 

The following examples are intended only to illustrate the present invention and should in 
30 no way be construed as limiting the subject invention. 

Examples 

Example 1: Direct Construct Construction from a Plasmid Library 

Genomic libraries using the lambda ZAP™ system were prepared as follows. Embryonic 
35 stem cells were grown in 100 mm tissue culture plates. High molecular weight genomic DNA 
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5 was isolated from these ES cells by adding 5 ml of lysis buffer (10 mM Tris-HCL pH7.5, 10 mM 
EDTA pH 8.0, 10 mM NaCl, 0.5% SDS, and 1 mg/ml Proteinase K) to a confluent 100 mm plate 
of embryonic stem cells. The cells were then incubated at 60°C for several hours or until fully 
lysed. Genomic DNA was purified from the lysed cells by several rounds of gentle 
phenolxhloroform extractions followed by ethanol precipitation. 

10 The genomic DNA was partially digested with the restriction enzyme Sau 3A I to 

generate fragments of approximately 5-20 kb. The ends of these fragments were partially filled 
in by addition of dATP and dGTP in the present of Klenow DNA polymerase, creating 
incompatible ends on the genomic fragments. Size fragments of between 5 and 10 kb were then 
purified by agarose gel electrophoresis (lx TAE, 0.8% gel). The DNA was then isolated from 

15 the excised agarose pieces using a QIAquick gel extraction kit (Qiagen, Inc., Valencia, CA). 
CI The genomic fragments were ligated into the Lambda Zap™ II vector (Stratagene, Inc., 

m La Jolla, CA) that had been cut with Xho I and partially filled in using dTTP, dCTP, and Klenow 

DNA polymerase. After ligation, the DNA was packaged using a lambda packaging mix 
Ml (Gigapack III gold, Stratagene, Inc., La Jolla, CA) and the titer was determined. 

20- ? l Circular phagemid DNA was derived from the lambda library by growing the lambda 

clones on the appropriate bacterial strain (XL-1 Blue MRF 1 , Stratagene, Inc.) in the presence of 
Ml the Ml 3 helper phage, ExAssist (Stratagene, Inc.). Specifically, approximately 100,000 lambda 
'q\ clones were incubated with a 10-100 fold excess of both bacteria and helper phage for 20 
!"-? minutes at 37°C. One ml of LB media + 10 mM MgSCU was added to each excision reaction and 

25 it was incubated overnight at 37 °C with shaking. Typically 24-96 of these reactions were set up 
at a time in a 96 well deep- well block. The following morning, the block was heated to 65 °C for 
15 minutes to kill both the bacteria and the lambda phage. Bacterial debris was removed by 
centrifugation at approximately 3000g for 15 minutes. The supernatant containing the circular 
phagemid DNA, was retained and used directly in plasmid PCR . 

30 The pools of phagemid DNA described above were screened for specific genes of interest 

using long-range PCR and "outward pointing" oligos, chosen as described above based on the 
known sequence (depicted in Figure 1). The PCR reactions contains 2 |Al of a pool phagemid 
DNA sample, 3 |xl of lOx PCR Buffer 3 (Boehringer Mannheim), 1.1 jd 10 mM dNTPs, 50 nM 
primers, 0.3 jlxI of EXPAND Long Template PCR Enzyme Mix (Boehringer-Mannheim) and 

35 30 \il of H 2 0. Cycling conditions were 94°C for 2 minutes (1 cycle); 94°C for 10 seconds, 65°C 
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5 for 30 seconds, 68°C for 15 seconds (15 cycles); 94°C for 10 seconds, 60°C for 30 seconds, 
68°C for 15 seconds plus 20 seconds increase per each additional cycle (25 cycles); 68°C for 7 
minutes (1 cycle) and holding at 4°C. 

The products of the PCR reactions were separated by electrophoresis through agarose 
gels containing IX TAE buffer and visualized with ethidium bromide and UV light. Any large 
10 fragments indicative of successful long-range PCR were excised from the gel and purified using 
QIAquick PCR purification kit (Qiagen). 

In order to eliminate the need to restriction map the PCR fragments, the following 
ligation-independent cloning strategy was employed. The long-range PCR fragment of interest 
was "purified" using a QIAquick PCR purification kit (Qiagen, Inc., Santa Clarita, California). 
15 Single-stranded ends of the PCR fragments were generated by mixing: 0. 1-2 |ug of the fragment; 
j «| 2 pi of NEB (New England BioLabs) Buffer 4; 1 jutl of 2 mM dTTP, 6 units of T4 DNA 
13 J polymerase (NEB), H 2 0 to total volume of 20 |il and incubating at 25°C for 30 minutes. The 
U polymerase was inactivated by heating at 75°C for 20 minutes. Single-stranded ends were also 
created on the Neo selectable marker fragment by digesting the plasmid vector pDG2 at the 
20" * unique restriction sites, with Sac I and Sac II (pDG2 depicted in Figure 2 A) and treating each 
q reaction with T4 DNA polymerase as above. The vector shown in Figure 1 was prepared with 
; s! f single-stranded ends complementary to those on the long-range PCR fragment. 
;!n The vector and fragments were then assembled into constructs using either a two-step 

.!."] 

ill cloning strategy or a four-way, single-step protocol. Briefly, a reaction containing 10 ng of T4- 
25 treated Neo 1 * cassette, 1 \xl of T4-treated PCR fragment, 0.2 |Lil of 0.5 M EDTA, 0.3 |il of 0.5 M 
NaCl and H2O up to 4 jal was heated to 65 °C and allowed to cool to room temperature over 
approximately 45 minutes. The mixture was then transformed into subcloning efficiency DH5-a 
competent cells. 

30 Example 2: Generation of Constructs from Phage Libraries 

A mouse embryonic stem cell library was prepared in lambda phage as follows. 
Genomic libraries were constructed from genomic DNA by partial cleavage of DNA at Sau 3AI 
sites to yield genomic fragments of approximately 20 kb in length. The terminal sequences of 
these DNA fragments were partially filled in using Klenow enzyme in the presence of dGTP and 
35 dATP and the fragments were ligated using T4 DNA ligase into Xho I sites of an appropriate 
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5 lambda cloning vector, e.g., lambda Fix E (Stratagene, Inc., La Jolla, California), which had 
been partially filled in using Klenow in the presence of dTTP and dCTP. Alternatively, the 
partially digested genomic DNA was size selected using a sucrose gradient and sequences of 
approximately 20 kb selected for. The enriched fraction was cloned into a Bam HI cut lambda 
vector, e.g., lambda Datsh II (Stratagene, Inc., La Jolla, California). 

10 The library was plated onto 1,152 plates, each plate containing approximately 1,000 

clones. Thus, a total of 1.1 million clones (the equivalent of 8 genomes) was plated. 

The phage were eluted from each plate by adding 4 ml of lambda elution buffer (10 mM 
MgCl2, 10 mM Tris-pH 8.0) to each plate and incubating for 3 to 5 hours at room temperature. 
After incubation, 2 ml of buffer was collected from each plate and placed into one well of a 96 

15 deep well plate (Costar, In.). Twelve 96-well plates were filled and referred to as the "sub-pool 
t;I | library." 

Using the sub-pool library, "pool libraries" were made by placing 100 \x\ of 12 different 
; ¥M sub-pool wells into one well of a new 96 well plate. The 12 sub-pool plates were combined to 
U f form 1 plate of pool libraries. 
2(L Using a pair of oligonucleotides that were known to PCR-amplify the gene of interest, 

;! 8i supernatant from the 96 pools of the "large-pool library" were amplified. PCR was performed in 
UJ the presence of 0.5 units of Amplitaq Gold™ (Perkin Elmer), 1 [iM of each oligonucleotide, 200 
r|l |LiM dNTPs, 2 ill of a 1 to 5 dilution of the pool (or subpool) supernatant, 50 mM KC1, 100 mM 
!;;[ Tris-HCl (pH 8.3), and either 1.5 mM or 1.25 mM MgCl 2 . Cycling conditions were 95°C for 8 
25 minutes (1 cycle); 95°C for 30 seconds, 60°C for 30 seconds, 72°C for 45 seconds (55 cycles); 
72°C for 7 minutes (1 cycle) and holding at 4°C. Depending on the gene, between about 3 and 
12 pool yielded positive signals as identified on agarose gels as described in Example 1. In cases 
where further purification was necessary (Le. where a clear signal was not present after 
amplification), the 12 sub-pools making up the pool were subjected to amplification using the 
30 same primers and a single sub-pool (1000 clones) was identified. 

Generation of flanking fragments. As described above, knock-out constructs contain two 
blocks of DNA sequence homologous to the target gene, flanking a positive selection marker. 
Long-range PCR was performed from the pools of lambda clones positively identified as 
described above in Example 2. Each fragment was generated using a pair of oligonucleotides 
35 with predetermined sequences lacking one type of base and complementary to predetermined 
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5 sequences on the vector. The fragments obtained were between 1 and 5 kb. A third fragment, 
longer than 5 kb, is also generated using appropriate oligonucleotides. This third fragment was 
then used to obtain DNA sequences near the gene to be knocked out but outside of the vector. 

Example 3: Two-Step Cloning- General Procedure 

10 The pDG2 plasmid vector (Figure 2A) contains unique restriction sites Sac II and Sac I 

Appropriate single-stranded annealing sites were generated by digesting the pDG2 vector with 
either restriction enzyme Sac II or Sac I and treating each reaction with T4 DNA polymerase and 
dTTP as described above. Four reactions were set up in microtitre plates for each vector, the 
reaction containing 1 |il of either (1) T4 DNA polymerase-treated fragments; (2) a 1:10 dilution 
15 of the T4-treated fragments reaction; (3) a 1: 100 dilution of the T4-treated fragments or (4) H2O 
(no insert control). The microtitre plates were sealed, placed in-between two temperature blocks 
heated to 65°C, and allowed to cool slowly at room temperature for 30 to 45 minutes. 
M The microtitre plate was then placed on ice and 20-25 fil of subcloning efficiency 

uj competent cells added to each well. The plate was incubated on ice for 20-30 minutes. The 
2(f* microtitre plate was then placed between two temperature blocks heated to 42°C for 2 minutes, 
^ followed by 2 minutes on ice. 100 (il of LB was added to each well, the plate covered with 
UJ parafilm and incubated 30-60 minutes at 37°C. The entire contents of each well were plated on 
'\ H% one LB-Amp plate and incubated at 37°C overnight. 

Q Between about 12-24 colonies were picked from plates which had at least 2-4 times more 

25 colonies than the no insert control. The colonies were grown in deep well plates overnight at 
37°C and then the plasmid DNA extracted using a Qiagen mini-prep kit. 

The plasmid DNA was digested with Not I and Sal I enzymes. As shown in Figure 2A, a 
Not I/Sal I digestion will generate a large fragment containing cloning sites 3 and 4 and a smaller 
fragment containing cloning sites 1 and 2 and the Neo r gene. After digestion, the reactions were 
30 run on a 0.8% agarose gel containing 0.2 fxg/ml ethidium bromide. For no inserts, two bands 
were present, one of 1975 base pairs and one of 2793 base pairs. When an insert fragment was 
present, at least one of these bands would be larger because it would also contain a fragment 
(insert 1 or 2) either at the annealing site 1/2 or the site 3/4. The insert bands were excised and 
treated with a QIAquick gel extraction kit. A second ligation reaction was performed containing 
35 1 jxl of 10X ligase buffer (50 mM Tris-HCI pH 7.5, 10 mM MgCl 2 , 10 mM dithiothreitol, 1 mM 
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5 ATP, 25 jag/ml bovine serum albumin), 1 |Lil T4 DNA ligase, 1-2 jjJ fragment (site 3/4 band), 5 \il 
of site 1/2 band and H2O up to 10 [il. Controls were also set up replacing either the site 3/4 
fragment or the site 1/2 fragment with water. The reactions were incubated 1 to 2 hours at room 
temperature and transformed with 25 jtil of competent cells. 

"Flanking DNA" in the context of these examples refers to the genomic sequences 

10 flanking the region in the target gene that is to be deleted or mutated. "Flanking DNA" is also 
described above as the blocks of DNA sequence homologous to the target gene. Rl genomic 
library refers to a genomic library prepared from the Rl ES cell line. Such libraries can be 
prepared such as described in Example L 

15 Example 4: Generation and Analysis of Mice Comprising Retina-Specific Nuclear 

(!S * Receptor Gene Disruptions 

To investigate the role of retina-specific nuclear receptors, disruptions in retina-specific 
^ nuclear receptor genes were produced by homologous recombination. Specifically, transgenic 
7Ml mice comprising disruptions in retina-specific nuclear receptor genes were created. More 
u ; particularly, a retina-specific targeting construct having the ability to disrupt or modify retina- 
! L specific nuclear receptor genes, specifically comprising SEQ ID NO: 19 was created using the 
m oligonucleotide sequences identified herein as SEQ ID NO:20 or SEQ ID NO:21. The targeting 
j*j5 construct was introduced into ES cells derived from the 129/Sv-+P+Mgf-SLJ/J mouse substrain 
25M to generate chimeric mice. Fl mice were generated by breeding with C57BL/6 females. F2 
homozygous mutant mice were produced by intercrossing Fl heterozygous males and females. 
The transgenic animals comprising disruptions in retina-specific nuclear receptor genes were 
analyzed for phenotypic changes and expression patterns. The phenotypes associated with a 
disruption in a retina-specific nuclear receptor gene were determined as follows: 
30 Homozygous Mice: 

The homozygous mice analyzed demonstrated at least one of the following phenotypes: 
Eyes, Eye abnormalities, including severe retinal dysplasia characterized by extensive 
rosette formation and retinal folding; segmental thinning of the outer nuclear layer of the retina 
with rods and cones filling the foci; and complete unilateral absence of the retina. Moreover, the 
35 space normally occupied by the retina was filled with fibrous connective tissue, spicules of 

osteoid and some mineral. In areas, connective tissue was adherent to the posterior lens capsule. 



59 



Posterior synechia with a thickened iris adherent to the anterior aspect of the lens was detected. 
The pigmented epithelial layer of the retina was thickened and its cells were increased in size and 
number. The internal structure of the lens was disorganized and comprised swollen and 
degenerated fibers. In instances where the retina was absent unilaterally, small focal remnants 
were present. 

Gastrointestinal tract. Abnormalities in the gastrointestinal tract included multifocal 
infiltrates of neutrophils in the deep mucosa and submucosa in the stomach. 

Skin, Abnormalities in the skin included focal lymphocytic inflammation within the 

dermis. 

Testes/Epididymides. Abnormalities in the testes and epididymides included reduced 
spermatogenesis. Specifically, seminiferous tubules had scattered degenerate or necrotic 
spermatogenetic epithelial cells and multinucleated giant cells. The epididymides had reduced 
number of spermatids, and degenerated cells were present in tubules. The epithelial cells of 
some epididymal tubules were vacuolated. 

Clinical Chemistry/Blood Analysis. Abnormalities included low alanine 
aminotransferase (ALT) values, aspartate aminotransferase (AST), and creatinine kinase (CK) 
values as compared to wild-type control values. Alkaline phosphatase (ALP) activity, however, 
was elevated. Hematological evaluation showed lower total white blood cell count. 
Heterozygous Mice: 

Skin. Abnormalities included local fibrosis and lymphocytic dermatitis. 

Liver. Abnormalities included pericholangitis with bile duct hyperplasia and fibrosis. 



As is apparent to one of skill in the art, various modifications of the above embodiments 
can be made without departing from the spirit and scope of this invention. These modifications 
and variations are within the scope of this invention. 
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